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METHODS FOR DETECTING MARKERS ASSOCIATED WITH ENDOMETRIAL DISEASE OR PHASE 

FIELD OF THE INVENTION 

The invention relates to methods for identifying markers associated with endometrium, endometrial 
markers, methods for assessing the status of an endometrial tissue, and methods for the detection, diagnosis, 
5 prediction, and therapy of an endometrial disease. 
BACKGROUND OF THE INVENTION 

The endometrium, the tissue lining the uterus, is a glandular layer of variable thickness that is very 
sensitive to the hormones estrogen and progesterone. During the menstrual cycle the endometrium undergoes 

« 

cyclic variation with a proliferation phase where the endometrium grows under the influence of estrogen, an 

1 0 ovulation phase where the endometrium is exposed to estrogen and progesterone, and a secretory or progesterone 
dominated phase where the endometrium shows signs of increased gland growth and secretion due largely to the 
influence of progesterone [1]. The secretory phase is followed by the shedding of the endometrium during 
menstruation. The histologic changes in the endometrium have been used to detect the stage or status of the 
endometrium, which is important for example, in determining the receptivity of patients to fertility procedures, 

15 and in determining responses to estrogen and/or progesterone therapies. 

Endometrial carcinoma is a common malignancy in women, being exceeded in incidence only by that of 
breast, lung, and colorectal cancers [10, 11, 48, 125-131]. The lifetime probability of a Canadian woman 
developing endometrial carcinoma is 2.2% [ 10]. Although the case-fatality rate for cancer of the endometrium is 
lower than that of many other cancer sites, this rate does not fully reflect the health care burden posed by 

20 endometrial carcinoma. Investigation of women with perimenopausal and postmenopausal bleeding for the 
presence of endometrial carcinoma is one of the most common gynecologic investigations and requires invasive 
endometrial sampling. Yet only a small proportion of these investigations will result in a diagnosis of 
endometrial carcinoma [12]. 

At present, no methods for screening or early detection of 'endometrial carcinoma are available nor are 

25 there any serum tumor markers available for the monitoring of endometrial carcinoma patients [13, 14]. 
Consequently, patients are diagnosed following the development of symptoms, and patients with recurrent 
disease are detected only following the development of recurrent symptoms, or abnormalities in imaging 
assessments. Sensitive and specific tumor marker(s) for endometrial carcinoma are urgently needed for screening 
and diagnosis. There is also a need for reliable endometrial markers for determining the stage/phase, and status of 

30 the endometrium. 

SUMMARY OF THE INVENTION 

Applicants have developed a method for identifjdng markers associated with the endometrium, and in 
particular with proliferative endometirum, secretory endometrium, and diseased endometrial tissue. Using the 
method they analyzed normal endometrial tissue homogenates and endometrial tumor tissue homogenates, and 

35 identified novel endometrial markers, in particular markers of secretory and proliferative endometrium and 
endometrial cancer markers. 

The invention relates to a method of characterizing or classifying a sample of endometrium by detecting 
or quantitating in the sample one or more polypeptides extracted from the sample that are characteristic of 
endometrium, a phase thereof or an endometrial disease, the method comprising assaying for differential 
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expression of polypeptides in the sample. Differential expression of polypeptides can be assayed using 
separation techniques known in the art, an antibody microarray, or mass spectroscopy of polypeptides extracted 
from the sample. The invention includes polypeptides identified using a method of the invention. 

In an aspect the invention provides a method of characterizing a sample of endometrium by detecting or 
5 quantitating in the sample one or more polypeptides extracted jfrom the sample that are characteristic of a 
proliferative, secretory or an endometrial disease the method comprising assaying for differential expression of 
proteins in the sample by mass spectroscopy of proteins extracted from the sample. In an embodiment, 
differential expression of the proteins is carried out using surface enhanced laser desorption/ionization (SELDI- 
TOF MS). 

10 In an embodiment, the invention provides a method for identifying markers associated with the 

endometrium or a phase thereof, or associated with an endometrial disease comprising: 

(a) obtaining a sample of endometrium from a subject; 

(b) extracting proteins from the sample and producing a profile of the proteins by subjecting the 
proteins to mass spectrometry; and 

15 (c) comparing the profile with a profile for normal endometrial tissue or for a known phase of 

endometrium to identify proteins associated with an endometrial disease or with the 
endometrium phase or. 

In another aspect the invention is directed to bioinformatic methods for analyzing differential expression 
data generated from the methods of the invention to identify further markers associated with the endometrium or 
20 phase thereof, or associated with endometrial disease. 

The invention relates to novel markers for the endometrium, and in particular markers of an endometrial 
disease, and compositions comprising same. 

The invention provides marker sets that distinguish the endometirum or phases thereof, or endometrial 
diseases, and uses therefor. A marker set may comprise a plurality of polypeptides and/or polynucleotides 
25 encoding such polypeptides comprising or consisting of at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, or 15 of 
the markers listed in Table 1, 4, 5, or 6 . In specific aspects, the markers consist of at least 5, 6, 7, 8, 9, or 10 
polypeptides listed in Table 1, 4, 5, or 6. In an aspect the protein marker sets comprise or consist of protein 
clusters, or proteins in pathways comprising markers listed in Table 1, 4, 5, and 6. 

In embodiments of the invention, a marker is provided which is selected from the group consisting of 
30 the polypeptides set forth in Table 1, which polypeptides are up-regulated biomarkers in endometrial cancer. 

In embodiments of the invention, a marker is provided which is selected from the group consisting of 
the polypeptides set forth in Table 1, which polypeptides are down-regulated biomarkers in endometrial cancer. 

The markers identified in accordance with a method of the invention, in particular the markers identified 
in Table 1, 4, 5, or 6, including but not limited to native-sequence polypeptides, isoforms, chimeric polypeptides, 
35 all homologs, fragments, and precursors of the markers, including modified forms of tlie polypeptides and 
derivatives are referred to herein as "endometrial marker(s)". Polynucleotides encoding endometrial markers are 
referred to herein as "endometrial polynucleotide marker(s)", "polynucleotides encoding endometrial markers", 
or "polynucleotides encoding the marker(s)". The endometrial markers and endometrial polynucleotide markers 
are sometimes collectively referred to herein as "marker(s)". Markers of endometrial cancer are referred to herein 
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as "endometrial cancer markers", "endometrial cancer polynucleotide markers", and "polynucleotides encoding 
endometrial cancer markers". 

Endometrial markers identified in accordance with a method of the invention, (including the 
endometrial cancer markers listed in Table 1, 4, 5, or 6), and polynucleotides encoding the markers, have 
5 application in the determination of the status or phase of the endometrium and in the detection of an endometrial 
disease such as endometrial cancer. Thus, the markers can be used for diagnosis, monitoring (i.e. monitoring 
progression or therapeutic treatment), prognosis, treatment, or classification of an endometrial disease (e.g. 
endometrial cancer), or as markers before surgery or after relapse. The invention also contemplates methods for 
assessing the status of an endometrial tissue, and methods for the diagnosis and therapy of an endometrial 
10 disease. 

The markers characteristic of different stages or phases of endometrium identified by a method of the 
invention may be used to identify the physiologic stage or phase of the endometrium within the physiologic 
cycle. In an aspect, the endometrial markers may be used to assess and manage reproductive disorders and 
infertility. In particular, endometrial markers associated with the secretory phase or proliferative phase identified 

15 by a method of the invention may be used to determine if an endometrium is at the optimum stage or phase for 
embryo implantation. In an embodiment, the endometrial markers are characteristic of the secretory phase, and 
include the markers glutamate receptor subunit zeta 1 [SEQ ID NO. 26] or a tryptic fragment thereof [e.g. SEQ 
ID NO. 28], macrophage migration inhibitory factor [SEQ ID NO. 1,8], FRATl [SEQ ID NO. 42], myosin light 
chain Idnase 2 [SEQ ID NO. 45], and tropomyosin 1 alpha chain [SEQ ID NO. 47], and polynucleotides 

20 encoding the polypeptides. 

In accordance with methods of the invention, endometrium can be assessed or characterized, for 
example, by detecting the presence in the sample of (a) an endometrial marker or fragment thereof; (b) a 
metabolite which is produced directly or indirectly by an endometrial marker; (c) a transcribed nucleic acid or 
fragment thereof having at least a portion with which an endometrial polynucleotide marker is substantially 

25 identical; and/or (c) a transcribed nucleic acid or fragment thereof, wherein the nucleic acid hybridizes with an 
endometrial polynucleotide marker. 

The levels of endometrial markers or endometrial polynucleotide markers in a sample may be 
determined by methods as described herein and generally known in the art. The expression levels may be 
determined by isolating and determining the level of nucleic acid transcribed from each endometrial 

30 polynucleotide. Alternatively or additionally, the levels of endometrial markers translated from mRNA 
transcribed from an endometrial polynucleotide marker may be determined. 

In an aspect, the invention provides a method for characterizing or classifying an endometrial sample 
comprising detecting a difference in the expression of a first plurality of endometrial markers or endometrial 
polynucleotide markers relative to a control, the first plurality of markers consisting of at least 2, 3, 4, 5, 6, 7, 8, 

35 9, 10, 1 1, 12, 13, 14, or 15 of the markers corresponding to the markers listed in Table 1, 4, 5, or 6. In specific 
aspects, the plurality of markers consists of at least 5 of the markers listed in Table 1, 4, 5, or 6, 

In an aspect, a method is provided for characterizing an endometrium by detecting endometrial markers 
or endometrial polynucleotide markers associated with an endometrium stage or phase, or endometrial disease in 
a patient comprising: 
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(a) obtaining a sample from a subject; 

(b) detecting or identifying in the sample endometrial markers or endometrial polynucleotide 
markers; and 

(c) comparing the detected amount with an amount detected for a standard. 

5 In an embodiment of the invention, a method is provided for detecting endometrial cancer niarkers or 

endometrial cancer polynucleotide markers associated wilh endometrial cancer in a patient comprising: 

(a) obtaining a sample from a patient; 

(b) detecting in the sample endometrial cancer markers or endometrial cancer polynucleotide 
markers; and 

10 (c) comparing the detected amount with an amount detected for a standard. 

The term "detecf or "detecting" includes assaying, imaging or otherwise establishing the presence or 
absence of the target endometrial markers or polynucleotides encoding the markers, subunits thereof, or 
combinations of reagent bound targets, and the like, or assaying for, imaging, ascertaining, establishing, or 
otherwise determining one or more factual characteristics of an endometrium phase or endometrial disease 
15 including cancer, metastasis, stage, or similar conditions. The term encompasses diagnostic, prognostic, and 
monitoring applications for the endometrial markers and polynucleotides encoding the markers. 

The invention also provides a method of assessing whether a patient is afflicted with or has a pre- 
disposition for endometrial disease, in particular endometrial cancer, the method comprising comparing: 

(a) levels of endometrial markers or polynucleotides encoding endometrial markers associated 
20 with the endometrial disease in a sample from the patient; and 

(b) normal levels of endometrial markers or polynucleotides encoding endometrial markers 
associated with the endometrial disease in samples of the same type obtained firom control 
patients not afElicted with the disease, wherein altered levels of the endometrial markers or the 
polynucleotides relative to the corresponding normal levels of endometrial markers or 

25 polynucleotides is an indication that the patient is afflicted with endometrial disease. 

In an aspect of a method of the invention for assessing whether a patient is afflicted with or has a pre- 
disposition for endometrial cancer, higher levels of endometrial cancer markers in a sample relative to tlie 
corresponding normal levels is an indication that the patient is afflicted with endometrial cancer. 

In another aspect of a method of the invention for assessing whether a patient is afflicted with or has a 
30 pre-disposition for endometrial cancer, lower levels of endometrial caiicer markers in a sample relative to the 
corresponding normal levels is an indication that the patient is afflicted with endometrial cancer. 

In a further aspect, a method for screening a subject for endometrial disease is provided comprising (a) 
obtaining a biological sample from a subject; (b) detecting the amount of endometrial markers associated with 
the disease in said sample; and (c) comparing said amount of endometrial markers detected to a predetermined 
35 standard, where detection of a level of endometrial markers that differs significantly from the standard indicates 
endometrial disease. 

In an embodiment, a significant difference between the levels of endometrial marker levels in a patient 
and normal levels is an indication that the patient is afflicted with or has a predisposition to endometrial disease. 
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In a particular embodiment the amount of endometrial marker(s) detected is greater than that of a 
standard and is indicative of endometrial disease, in particular endometrial cancer. In another particular 
embodiment the amount of endometrial marker(s) detected is lower than that of a standard and is indicative of 
endometrial disease, in particular endometrial cancer. 
5 In aspects of the methods of the invention, the methods are non-invasive for detecting endometrium 

phase or endometrial disease which in tum allow for diagnosis of a variety of conditions or diseases associated 
with the endometrium. 

In particular, the invention provides a non-invasive non-surgical method for detection, diagnosis or 
prediction of endometrial disease in a subject comprising: obtaining a sample of blood, plasma, serum, urine or 
10 saliva or a tissue sample from the subject; subjecting the sample to a procedure to detect endometrial markers or 
endometrial polynucleotide markers in the blood, plasma, serum, urine, saliva or tissue; detecting, diagnosing, 
and predicting endometrial disease by comparing the levels of endometrial markers or endometrial 
polynucleotide markers to the levels of marker{s) or polynucleotide(s) obtained from a control subject witli no 
endometrial disease. 

15 In an embodiment, endometrial disease is detected, diagnosed, or predicted by determination of 

increased levels of markers (e.g Table 1 up-regulated markers) when compared to such levels obtained from the 
control. 

In another embodiment, endometrial disease is detected, diagnosed, or predicted by determination of 
decreased levels of markers (e.g. Table 1 down-regulated markers) when compared to such levels obtained from 
20 the control. 

The invention also provides a method for assessing the aggressiveness or indolence of an endometrial 
disease in particular cancer (e.g. staging), the method comprising comparing: 

(a) levels of endometrial markers or polynucleotides encoding endometrial markers associated 
with the endometrial disease in a patient sample; and 
25 (b) normal levels of the endometrial markers or the polynucleotides in a control sample. 

In an embodiment, a significant difference between the levels in the sample and the normal levels is an 
indication that the endometrial disease, in particular cancer, is aggressive or indolent In a particular embodiment, 
the levels of endometrial markers are higher than normal levels. In another particular embodiment, the levels of 
endometrial markers are lower than normal levels. 
30 In an aspect, the invention provides a method for determining whether a cancer has metastasized or is 

likely to metastasize in the friture, the method comprising comparing: 

(a) levels of endometrial cancer markers or polynucleotides encoding endometrial cancer markers 
in a patient sample; and 

(b) normal levels (or non-metastatic levels) of the endometrial cancer markers or polynucleotides 
35 in a control sample. 

In an embodiment, a significant difference between the levels in the patient sample and the normal 
levels is an indication that the cancer has metastasized or is likely to metastasize in the fixture. 

In another aspect, the invention provides a method for monitoring the progression of endometrial 
disease, in particular endometrial cancer in a patient the method comprising: 
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(a) detecting endometrial markers or polynucleotides encoding the markers associated with the 
disease in a sample from the patient at a first time point; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the levels detected in (a) and (b), and therefrom monitoring the progression of the 
5 endometrial disease. 

The invention contemplates a method for determining the effect of an environmental factor on the 
endometrium or phase thereof, or endometrial disease comprising comparing endometrial polynucleotide 
markers or endometrial markers in the presence and absence of the environmental factor. 

.The invention also provides a method for assessing the potential efficacy of a test agent for inliibiting 
10 endometrial disease, and a method of selecting an agent for inhibiting endometrial disease. 

The invention contemplates a method of assessing the potential of a test compound to contribute to an 
endometrial disease comprising: 

(a) maintaining separate aliquots of endometrial diseased cells in the presence and absence of the 
test compound; and 

15 (b) comparing the levels of endometrial markers or polynucleotides encoding the markers 

associated with the disease in each of the aliquots. 
A significant difference between the levels of endometrial markers or polynucleotides encoding the 
markers in an aliquot maintained in the presence of (or exposed to) the test compound relative to the aliquot 
maintained in the absence of the test compound, indicates that the test compound potentially contributes to 
20 endometrial disease. 

The invention further relates to a method of assessing the efficacy of a therapy for inhibiting 
endometrial disease in a patient. A method of the invention comprises comparing: (a) levels of endometrial 
markers or polynucleotides encoding the markers associated with disease in a first sample from the patient 
obtained from the patient prior to providing at least a portion of the therapy to the patient; and (b) levels of 
25 endometrial markers or polynucleotides encoding the markers associated with disease in a second sample 
obtained from the patient fr)llowing therapy. 

In an embodiment, a significant difference between the levels of endometrial markers or polynucleotides 
encoding the markers in the second sample relative to the first sample is an indication that the therapy is 
efficacious for inhibiting endometrial disease. 
30 hi a particular embodiment, the method is used to assess liie efficacy of a therapy for inhibiting 

endometrial disease (e.g. endometrial cancer), where lower levels of endometrial markers or polynucleotides 
encoding same in the second sample relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting the disease. . 

The "therapy" may be any therapy for treating endometrial disease, in particular endometrial cancer, 
35 including but not Umited to therapeutics, radiation, immunotherapy, gene therapy, and surgical removal of tissue. 
Therefore, the method can be used to evaluate a patient before, during, and after therapy. 

Certain methods of the invention employ binding agents (e.g. antibodies) that specifically recognize 
endometrial markers. 
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In an embodiment, the invention provides methods for determining the presence or absence of 
endometrial disease, in particular endometrial cancer, in a patient, comprising the steps of (a) contacting a 
biological sample obtained from a patient with one or more binding agent that specifically binds to one or more 
endometrial markers associated with the disease; and (b) detecting in the sample an amount of marker that binds 
5 to the binding agent, relative to a predetermined standard or cut-off value, and therefrom determining the 
presence or absence of endometrial disease in the patient. 

In another embodiment, the invention relates to a method for.diagnosing and monitoring an endometrial 
disease, in particular endometrial cancer, in a subject by quantitating one or more endometrial markers associated 
with the disease in a biological sample from the subject comprising (a) reacting the biological sample with one or 
10 more binding agent specific for the endometrial markers (e.g. an antibody) that are directly or indirectly labelled 
with a detectable substance; and (b) detecting the detectable substance. 

In another aspect the invention provides a method for using an antibody to detect expression of one or 
more endometrial marker in a sample, the method comprising: (a) combining antibodies specific for one or more 
endometrial marker with a sample under conditions which allow the formation of antibodyimarker complexes; 
15 and (b) detecting complex formation, wherein complex formation indicates expression of the marker in the 
sample. Expression may be compared with standards and is diagnostic of an endometrial disease, in particular 
endometrial cancer. 

Embodiments of the methods of the invention involve (a) reacting a biological sample from a subject 
with antibodies specific for one or more endometrial markers which are directly or indirectly labelled with an 
20 enzyme; (b) adding a substrate for the enzyme wherein the substrate is selected so that the substrate, or a reaction 
product of the enzyme and substrate forms fluorescent complexes; (c) quantitating one or more endometrial 
markers in the sample by measuring fluorescence of the fluorescent complexes; and (d) comparing the 
quantitated levels to levels obtained for other samples from the subject patient, or control subjects. 

In another embodiment the quantitated levels are compared to levels quantitated for control subjects 
25 (e.g. normal or benign) without an endometrial disease {e.g. cancer) wherein an increase in endometrial marker 
levels compared with the control subjects is indicative of endometrial disease. 

In a further embodiment the quantitated levels are compared to levels quantitated for control subjects 
(e.g. normal or benign) without an endometrial disease (e.g. cancer) wherein a decrease in endometrial marker 
levels compared with the control subjects is indicative of endometrial disease. 
30 A particular embodiment of the invention comprises the following steps 

(a) incubating a biological sample with first antibodies specific for one or more endometrial 
cancer markers which are directly or indirectly labeled with a detectable substance, and second 
antibodies specific for one or more endometrial cancer markers which are immobilized; 

(b) detecting the detectable substance thereby quantitating endometrial cancer markers in the 
35 biological sample; and 

(c) comparing the quantitated endometrial cancer markers with levels for a predetermined 
standard. 

The standard may correspond to levels quantitated for samples from control subjects without 
endometrial cancer (normal or benign), with a different disease stage, or from other samples of the subject In an 
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embodiment, increased levels of endometrial cancer markers as compared to the standard may be indicative of 
endometrial cancer. In another embodiment, lower levels of endometrial cancer markers as compared to a 
standard may be indicative of endometrial cancer. 

Endometrial marker levels can be determined by constructing an antibody microarray in which binding 
5 sites comprise immobilized, preferably monoclonal, antibodies specific to a substantial fraction of marker- 
derived endometrial marker proteins of interest. 

Other methods of the invention employ one or more polynucleotides capable of hybridizing to one or 
more polynucleotides encoding endometrial markers. Thus, methods for detecting endometrial markers can be 
used to monitor an endometrial disease (e.g. cancer) by detecting endometrial polynucleotide markers associated 
10 with the disease. 

Thus, the present invention relates to a method for diagnosing and monitoring an endometrial disease 
(e.g. endometrial cancer) in a sample jfrom a subject comprising isolating nucleic acids, preferably mRNA, from 
the sample; and detecting endometrial marker polynucleotides associated with the disease in the sample. The 
presence of different levels of endometrial marker polynucleotides in the sample compared to a standard or 

15 control may be indicative of endometrium phase, disease, disease stage, and/or a positive prognosis i.e. longer 
progression-free and overall survival. 

In embodiments of the invention, endometrial cancer marker polynucleotide positive tumors (e.g. higher 
levels of the polynucleotides compared to a control normal or benign tissue) are a negative diagnostic indicator. 
Positive tumors can be indicative of endometrial cancer, advanced stage disease, lower progression-free survival, 

20 and/or overall survival. 

In other embodiments of the invention, endometrial cancer marker polynucleotide negative tumors (e.g. 
lower levels of the polynucleotides compared to a control normal or benign tissue) are a negative diagnostic 
indicator. Negative tumors can be indicative of endometrial cancer, advanced stage disease, lower progression- 
free survival, and/or overall survival. 

25 The invention provides methods for determining the presence or absence of an endometrial disease in a 

subject comprising detecting in the sample levels of nucleic acids that hybridize to one or more polynucleotides 
encoding endometrial markers associated with the disease, comparing the levels with a predetermined standard or 
cut-off value, and therefrom determining the presence or absence of endometrial disease in the subject. In an 
embodiment, the invention provides methods for determining the presence or absence of endometrial cancer in a 

30 subject comprising (a) contacting a sample obtained from the subject with oligonucleotides that hybridize to one 
or more polynucleotides encoding endometrial cancer markers; and (b) detecting in the sample a level of nucleic 
acids that hybridize to the polynucleotides relative to a predetermined cut-off value, and therefrom determining 
the presence or absence of endometrial cancer in the subject. 

Within certain embodiments, the amount of polynucleotides that are mRNA are detected via polymerase 

35 chain reaction using, for example, oligonucleotide primers that hybridize to one or more polynucleotides 
encoding endometrial markers, or complements of such polynucleotides. Within other embodiments, the amount 
of mRNA is detected using a hybridization technique, employing oligonucleotide probes that hybridize to one or 
more polynucleotides encoding endometrial markers, or complements thereof. 
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When using mRNA detection, the method may be carried out by combining isolated mRNA with 
reagents to convert to cDNA according to standard methods; treating the converted cDNA with amplification 
reaction reagents (such as cDNA PGR reaction reagents) in a container along with an appropriate mixture of 
nucleic acid primers; reacting the contents of the container to produce amplification products; and analyzing the 
5 amplification products to detect the presence of one or more endometrial polynucleotide markers in the sample. 
For mRNA the analyzing step may be accomplished using Northern Blot analysis to detect the presence of 
endometrial polynucleotide markers. The analysis step may be further accomplished by quantitatively detecting 
the presence of endometrial polynucleotide markers in the amplification product, and comparing the quantity of 
marker detected against a panel of expected values for the known presence or absence of the markers in normal 
10 and malignant tissue derived using similar primers. 

Therefore, the invention provides a method wherein mRNA is detected by (a) isolating mRNA from a 
sample arid combining the mRNA with reagents to convert it to cDNA; (b) treating the converted cDNA with 
amplification reaction reagents and nucleic acid primers that hybridize to one or more endometrial 
polynucleotide markers to produce amplification products; (d) analyzing the amplijBcation products to detect an 
15 amount of mRNA encoding the endometrial markers; and (e) comparing the amount of mRNA to an amount 
detected against a panel of expected values for normal and diseased tissue (e.g. malignant tissue) derived using 
similar nucleic acid primers. 

In particular embodiments of the invention, the methods described herein utilize the endometrial 
polynucleotide markers placed on a microarray so that the expression status of each of the markers is assessed 
20 simultaneously. 

In a particular aspect, the invention provides an endometrial microarray comprising a defined set of 
genes whose expression is significantly altered by endometrium phase or endometrial disease. The invention 
further relates to the use of the microarray as a prognostic tool to predict endometrium phase or endometrial 
disease. In an embodiment, the endometrial microarray discriminates between endometrial disease resulting from 
25 different etiologies. 

In an embodiment, the invention provides for oligonucleotide arrays comprising marker sets described 
herein. The microarrays provided by the present invention may comprise probes to markers able to distinguish 
endometrium phase or disease. In particular, the invention provides oligonucleotide arrays comprising probes to 
a subset or subsets of at least 5 or 10 gene markers up to a full set of markers which distinguish endometrium 
30 phase or endometrial disease. 

The invention also contemplates a method comprising administering to cells or tissues imaging agents 
that carry labels for imaging and bind to endometrial markers and optionally other markers of an endometrial 
disease, and then imaging the cells or tissues. 

In an aspect the invention provides an in vivo method comprising administering to a subject an agent 
35 that has been constructed to target one or more endometrial markers. 

In a particular embodiment, the invention contemplates an in vivo method comprising administering to a 
mammal one or more agent that carries a label for imaging and binds to one or more endometrial marker, and 
then imaging the mammal. 
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According to a particular aspect of the invention, an in vivo method for imaging endometrial cancer is 
provided comprising: 

(a) injecting a patient with an agent that binds to one or more endometrial cancer marker, the 
agent carrying a label for imaging the endometrial cancer; 
5 (b) allowing the agent to incubate in vivo and bind to one or more endometrial cancer marker 

associated with the endometrial cancer; and 
(c) detecting the presence of the label localized to the endometrial cancer. 
In an embodiment of the invention the agent is an antibody which recognizes an endometrial cancer 
marker. In another embodiment of the invention the agent is a chemical entity which recognizes an endometrial 
10 cancer marker. 

An agent carries a label to image an endometrial marker and optionally other markers. Examples of 
labels useful for imaging are radiolabels, fluorescent labels (e.g fluorescein and rhodamine), nuclear magnetic 
resonance active labels, positron emitting isotopes detectable by a positron emission tomography ("PET") 
scanner, chemiluminescers such as luciferin, and enzymatic markers such as peroxidase or phosphatase. Short- 
15 range radiation emitters, such as isotopes detectable by short-range detector probes can also be employed. 

The invention also contemplates the localization or imaging methods described herein using multiple 
markers for an endometrial disease (e.g. endometrial cancer). 

The invention also relates to kits for carrying out the methods of the invention. In an embodiment, a kit 
is for assessing whether a patient is afflicted with an endometrial disease (e.g. endometrial cancer) and it 
20 comprises reagents for assessing one or more endometrial markers or polynucleotides encoding the markers. 

The invention further provides kits comprising marker sets described herein. In an aspect the kit 
contains a microarray ready for hybridization to target endometrial oligonucleotide markers, plus software for the 
data analyses. 

The invention also provides a diagnostic composition comprising an endometrial marker or a 
25 polynucleotide encoding the marker. A composition is also provided comprising a probe that specifically 
hybridizes to endometrial polynucleotide markers, or a fragment thereof, or an antibody speciiSc for endometrial 
markers or a fragment thereof. In another aspect, a composition is provided comprising one or more endometrial 
polynucleotide marker specific primer pairs capable of amplifying the polynucleotides using polymerase chain 
reaction methodologies. The probes, primers or antibodies can be labeled with a detectable substance. 
30 Still further the invention relates to therapeutic applications for endometrial diseases, in particular 

endometrial cancer, employing endometrial markers and polynucleotides encoding the markers, and/or binding 
agents for the markers. 

In an aspect, the invention relates to compositions comprising markers or parts thereof associated with 
an endometrial disease, or antibodies specific for endometrial markers associated with an endometrial disease, 
35 and a pharmaceutically acceptable carrier, excipient, or diluent. A method for treating or preventing an 
endometrial disease, in particular endometrial cancer, in a patient is also provided comprising administering to a 
patient in need thereof, markers or parts thereof associated with an endometrial disease, antibodies specific for 
endometrial markers associated with an endometrial disease, or a composition of the invention. In an aspect the 
invention provides a method of treating a patient afflicted with or at risk of developing an endometrial disease 



wo 2005/061725 



PCT/CA2004/002172 



-11- 

(e.g. endometrial cancer) comprising inhibiting expression of endometrial markers. 

In an aspect, the invention provides antibodies specific for endometrial markers associated with a 
disease (e.g. endometrial cancer) that can be used therapeutically to destroy or inhibit the disease ( e.g. the 
growth of endometrial cancer marker expressing cancer cells), or to block endometrial marker activity associated 
5 with a disease. In an aspect, endometrial cancer markers may be used in various immunotherapeutic methods to 
promote immune-mediated destruction or growth inhibition of tumors expressing endometrial cancer markers. 

The invention also contemplates a method of using endometrial markers or parts thereof, or antibodies 
specific for endometrial markers in the preparation or manufacture of a medicament for the prevention or 
treatment of an endometrial disease e.g. endometrial cancer. 
10 Another aspect of the invention is the use of endometrial markers, peptides derived therej5:om, or 

chemically produced (synthetic) peptides, or any combination of these molecules, for use in the preparation of 
vaccines to prevent an endometrial disease and/or to treat an endometrial disease. 

The invention contemplates vaccines for stimulating or enhancing in a subject to whom the vaccine is 
administered production of antibodies directed against one or more endometrial markers. 
1 5 The invention also provides a method for stimulating or enhancing in a subject production of antibodies 

directed against one or more endometrial marker. The method comprises administering to the subject a vaccine 
of the invention in a dose effective for stimulating or enhancing production of the antibodies. 

The invention further provides a method for treating, preventing, or delaying recurrence of an 
endometrial disease (e.g. endometrial cancer). The method comprises administering to the subject a vaccine of 
20 the invention in a dose effective for treating, preventing, or delaying recurrence of an endometrial disease (e.g. 
endometrial cancer). 

The invention contemplates the methods, compositions, and kits described herein using additional 
markers associated with an endometrial disease (e.g. endometrial cancer). The methods described herein maybe 
modified by including reagents to detect the additional markers, or polynucleotides for the markers. 

25 In particular, the invention contemplates the methods described herein using multiple markers for an 

endometrial cancer. Therefore, the invention contemplates a method for anaylzing a biological sample for the 
presence of endometrial cancer markers and polynucleotides encoding the markers, and other markers that are 
specific indicators of cancer, in particular endometrial cancer. The methods described herein may be modified by 
including reagents to detect the additional markers, or nucleic acids for the additional markers. 

30 In embodiments of the invention the methods, compositions and kits use one or more of the markers 

listed in Table 1, 4, 5, or 6.. In another embodiment, the method uses a panel of markers selected firom the 
markers listed in Table 1, 4, 5, or 6 in particular a panel comprising two or more of the markers in Table 1, 4, 5 
or 6. 

Other objects, features and advantages of the present invention will become apparent firom the following 
35 detailed description. It should be understood, however, that the detailed description and the specific examples 
while indicating preferred embodiments of the invention are given by way of illustration only, since various 
changes and modifications within the spirit and scope of the invention will become apparent to those skilled in 
the art firom this detailed description. 
DESCRIPTION OF THE DRAWINGS 



wo 2005/061725 



PCT/CA2004/002172 



- 12- 

The invention will now be described in relation to the drawings in which: 

Figure 1 shows a schematic representation of the ICAT procedure including labeling, digestion, SCX 
fractionation, affinity isolation, cleavage of the biotin moiety and LC-MS/MS analysis. 

Figure 2 shows histologic sections of human endometrium, stained with hematoxylin and eosin. A) 
5 proliferative endometrium, sample PRO 2, and B) secretory endometrium, sample SEC2. 

Figure 3 shows a NanoLC-MS TIC of SCX fraction 16, ICAT sample A. 

Figure 4 shows the distribution of proteins identified from the middle ten SCX fractions of all tliree 

samples. 

Figure 5 shows A) a mass spectral window of Sample A, SCX fraction 19, showing enhanced 
10 expression of glutamate receptor subunit zeta 1 precursor in the secretory endometrium. The line to the left of 
the isotope envelope of the heavy-labeled version of the protein denotes where the light-labeled version is 
expected. B) Resulting MS/MS spectra of the heavy labelled series showing sequence coverage. 

Figure 6 shows A) a mass spectral window of Sample A, SCX fraction 13, showing enhanced 
expression of MIF in the secretory endometrium. B) Resulting MS/MS spectra of the heavy labelled series 
15 sequence coverage. 

Figure 7 (a+b). STEP ONE Mass spectra obtained from five whole endometrial tissue homogeneates on 
ProteinchipWCX2 using (la) SELDI-TOF-MS, PBSHc, and (lb) QqTOF-MS, QSTARXL. Note, using either 
MS method, the three malignant cases (38, 39, 40) show a distinct protein peak, which is low in the two non- 
mahgnant cases [18, 19]. The QSTARXL MS determined the weight of this peak to be 10,843 Da. Figure 7 

F 

20 STEP TWO Outline of protein purification. The target protein eluted in one of the early fractions of the size 
exclusion HPLC, was concentrated by Proteospin and then subjected to MALDI and SDS-PAGE (see Figure 8). 

Figure 8 (A to E). STEP THREE (8A), Molecular weight verification of target protein after SEC. 
MALDI-TOF MS of samples from Proteospin was used to locate the fraction with the 10,843 Da wei^t of the 
target protein in a malignant case (40).The corresponding fraction from normal sample was used in parallel for 

25 all the following experiments. (8B) Molecular weight verification of the target protein fraction following DTT 
treatment. MALDI-TOF analysis of two samples indicates that the 10,843 Da peak remained in the malignant 
sample even after DTT treatment. (8C) SDS-PAGE analysis of the DTT treated fractions. The gel was stained by 
colloidal coomassie blue (Gel-Code, Pierce). The region containing the target protein was excised according to 
the molecular weight marker, and the mtact proteins were extracted. (8D) Tryptic peptide profiles of the SDS- 

30 PAGE protein extract. MALDI-TOF MS analysis identified six "unique" tryptic peptides in the malignant 
samples. Three of them with mass of 907.54, 1036.58, and 1529.82 Da matched with the tryptic peptides from 
chaperonin 10. The remaining three peaks, marked with *, were from Keratin, (insert) Amino acid sequence of 
chaperonin 10. The locations of the three peptides were underlined. (8E) Fragmentation of peptide by collision 
induced dissociation. The 1 529. 82 Da peptide was fragmented. A series of y ions confirmed that the 1529.82 Da 

35 peptide comes from chaperonin 10, 

Figure 9. Western blot analysis against chaperonin 10. The malignant endometrial cases (EmCa) 
demonstrate a higher expression of chaperonin 10 than non-malignant cases (Normal). The level of ERKl was 
used to demonstrate similar loading among all lanes. 
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Figure 10. Immunohistochemical staining for chaperonin 10. Note^anular c3^oplasmic positivity in 
several malignant endometrioid glands, while adjacent endometrial stroma and a portion of one benign 
endometrial gland (right) shows much reduced staining. 

Figure 1 1 shows mass spectral windows from isotope- coded affinity tag (ICAT) experiments for three 
5 pairs of endometrial cancer / normal samples, demonstrating the over expression of calgizzarin in the cancer 
samples. 

Figure 12: Number of peptides/ protein identified. The &st (front) series corresponds to results obtained 
from cICAT, the second series is from the first iTRAQ run, and the third (rearmost) series is from the second 
iTRAQ run. 

1 0 Figure 13 : Cytoplasmic actin sequencing and quantification using iTRAQ. A : MS/MS spectrum of the 

doubly protonated peptide, EITALAPSTMK [SEQ ID NO. 49], at 725.4 Th. Residues in black are sequenced 
from the b-ion series, while those in purple from the y-ion series. Residue 'J' in the y-ion series is the iTRAQ- 
modified lysine residue. B: Expanded view of the low-m/z end of the MS/MS spectrum in 'A showing relative 
abundances of the signature iTRAQ ions at 114.1, 115.1, 116.1 and 117.1 Th. 
15 Figure 14: Relative abundances of differentially expressed proteins. The ion assignments are as 

follows: 114 Th, proliferative endometrium; 115 Th, secretory endometrium; 116 Th, first endometrial 
carcinoma; and 1 17 Th, second endometrial carcinoma. A: Tryptic peptide from chaperonin 10, showing higher 
abundances in the 1 16 and 1 17 Th ions relative to the 114 and 1 15 Th ions; B: tryptic peptide from alpha- 1- 
antitrypsin precursor, showing lower abundances in the 116 and 1 17 Th ions relative to the 114 and 115 Th ions; 
20 C: tryptic peptide from creatine kinase B, showing lower abundances in the 1 16 and 1 17 Th ions relative to the 
1 14 and 1 15 Th ions; D: tryptic peptide from transgelin, showing once more lower abundances in the 116 and 
1 17 Th ions relative to the 1 14 and 1 15 Th ions. 

Figure 15 : Relative abundances of calgizzarin. cICAT analysis shows overexpression in all three cancer 
samples. The dotted line to the left of the heavy-label series, which originates from the cancerous sample, 
25 indicates the monoisotopic peak of the light-label series, which originates from the normal sample. 

Figure 16: Distribution of proteins identified by iTRAQ and B: cICAT analyses. 
Figure 17 shows histologic sections of human endometrium stained with hematoxylin and eosin: (a) 
proliferative endometrium, sample PR02, and (b) secretory endometrium, sample SEC2. 

Figure 18 is a nano LC/MS total ion chromatogram of strong cation exchange fraction 16 of sample A. 
30 Figure 19 shows the distribution of 1 1 9 confirmed proteins. The legend shows the protein categories 

starting with "Structural" at 18% at the 2 o'clock position and continuing clockwise. 

Figure 20 shows the appearance of selected proteins in the strong cation exchange fractions. 
Figure 21 shows the (a) mass spectral window of sample A, fraction 19, showing enhanced expression 
of glutamate receptor subunit zeta 1 precursor in the secretory endometrium. The line to the left of the isotope 
35 envelope of the heavy-labeled version of the protein denotes where the light-labeled version is expected; (b) 
resulting MS/MS spectrum showing a partial sequence for the peptide LLTLALLFSCS VAR [SEQ ID NO. 28]. 

Figure 22 shows the mass spectral windows of the three sample pairs, fraction 15, showing enlianced 
expression of FRATl. 
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Figure 23 shows the mass spectral windows of sample A, fraction 4, showing enhanced expression of 
glycodelin in the secretory endometrium. 

Figure 24 shows the mass spectral windows showing the isotopic clusters of cathepsin B. Sample A 
shows enhanced cathepsin B expression in the secretory endometrium; sample B shows the opposite trend. 
5 Figure 25 shows SELDI MS spectral windows acquired on the QqTOF mass spectrometer (QSTAR 

XL): (a) typical tissue homogenate; (b) homogenate treated with 0 % hydrogen peroxide; (c) homogenate in (b) 
treated incubated with 3% hydrogen peroxide overnight; (d) laser fiuence at 34.6 pJ; and (e) laser fluence at 76.9 
pj. 

Figure 26 shows SELDI MS spectra of four malignant (designated by "C") and two endometrial tissue 
10 homogenates (designated by "N"): (a) full scan spectra collected on the linear QTOF mass spectrometer (PBS 

lie); and (b) mass spectral windows collected on the PBS lie, left panel, and on the QqTOF mass spectrometer 

(QSTAR XL), right panel. 

Figure 27 shows SELDI MS spectral windows showing binding of both (a) chaperonin 10 and (b) the 

unknown protein at pH 6.0; and selective binding of the unlcnown protein at pH 7.0. 
15 Figure 28 shows MS/MS spectra of three tryptic peptides from the unknown protein that identify it as 

calgranulin A. Peptide identities: (a) MLTELEK [SEQ ID NO. 50], (b) ALNSIIDVYHK [SEQ ID NO. 5 1], and 

(c) GADVWFK [SEQ ID NO. 52]. 

Figure 29 shows Calgranulin A immunoliistochemical stain of endometrial TMA (US Biochem 1 : 150): 

(A) Endometrioid adenocarcinoma exhibits diffuse, immunostaining of numerous cells (3+) of the glandular 
20 component of the adenocarcinoma. Intense immunostaining of macrophages/granulocytes within the stroma and 

glandular lumina is also evident. No stromal staining is observed. (B) Endometrioid adenocarcinoma shows 

strong positive immunostaining of occasional malignant glands (1+). Intense individual cell staining is also 

apparent in macrophages/granulocytes within gland lumina and stroma. 

Figure 30 shows Calgranulin A immunohistochemical stain of endometrial TMA (US Biochem 1: 150): 
25 Note intense (3+) cytoplasmic and nuclear immunostaining in squamous areas of this endometrioid 

adenocarcinoma. Glandular or columnar areas of the adenocarcinoma and adjacent stroma do not demonstrate 

any staining in this particular case. 

Figure 3 1 shows Calgranulin A immunohistochemical stain of endometrial TMA (US Biochem 1:150): 

The glands and stroma of this proliferative endometrium show no immunostaining. However, intense individual 
30 macrophage/granulocyte inmiunostaining is evident within the stroma. ' 

DETAILED DESCRIPTION OF THE INVENTION 

Methods are provided for characterizing the stage or phase of endometrium, detecting the presence of an 

endometrial disease (e.g. endometrial cancer) in a sample, the absence of a disease (e.g. endometrial cancer) in a 

sample, the stage of a disease, the stage or grade of the disease, and other characteristics of endometrial diseases 
35 that are relevant to prevention, diagnosis, characterization, and therapy of endometrial diseases such as cancer in 

a patient, for example, the benign or malignant nature of an endometrial cancer, the metastatic potential of an 

endometrial cancer, assessing the histological type of neoplasm associated with an endometrial cancer, the 

indolence or aggressiveness of an endometrial cancer, and other characteristics of endometrial diseases that are 

relevant to prevention, diagnosis, characterization, and therapy of endometrial diseases such as cancer in a 
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patient. Methods are also provided for assessing the efficacy of one or more test agents for inhibiting an 
endometrial disease, assessing the efEicacy of a therapy for an endometrial disease, monitoring the progression of 
an endometrial disease, selecting an agent or therapy for inhibiting an endometrial disease, treating a patient 
afflicted with an endometrial disease, inhibiting an endometrial disease in a patient, and assessing the disease 
5 (e.g. carcinogenic) potential of a test compound. 
Glossary 

"Endometrial disease" refers to any disorder, disease, condition, syndrome or combination of 
manifestations or symptoms recognized or diagnosed as a disorder of the endometrium, including but not limited 
to hyperplasia and cancer precursors, endometrial cancer or carcinoma, endometriosis, reproductive disorders, 
10 and infertility. 

"Endometrial cancer" or "endometrial carcinoma" includes malignant endometrial disease including but 
not limited to endometrioid, mucinous, and serous adenocarcinomas, adenosquamous carcinomas, clear cell 
carcinomas, uterine sarcomas including stromal sarcomas, malignant mixed MuUerian tumors (carcinosarcomas), 
and leiomyosarcomas. 

15 The terms "sample", "biological sample", and the like mean a material known or suspected of 

expressing or containing one or more endometrial polynucleotide markers or one or more endometrial markers. 
A test sample can be used directly as obtained from the source or following a pretreatment to modify the 
character of the sample. The sample can be derived from any biological source, such as tissues, extracts, or cell 
cultures, including cells (e.g. tumor cells), cell lysates, and physiological fluids, such as, for example, whole 

20 blood, plasma, serum, saliva, ocular lens fluid, cerebral spinal fluid, sweat, urine, milk, ascites fluid, synovial 
fluid, peritoneal fluid, lavage fluid, and the like. The sample can be obtained from animals, preferably mammals, 
most preferably humans. The sample can be treated prior to use, such as preparing plasma from blood, diluting 
viscous fluids, and the like. Methods of treatment can involve filtration,' distillation, extraction, concentration, 
inactivation of interfering components, the addition of reagents, and the like. 

25 In embodiments of the invention the sample is a mammalian tissue sample. In a particular embodiment, 

the tissue is endometrial tissue. 

In another embodiment the sample is a human physiological fluid. In a particular embodiment, the 
sample is human serum. 

The samples that may be analyzed in accordance with the invention include polynucleotides from 
30 clinically relevant sources, preferably expressed RNA or a nucleic acid derived therefrom (cDNA or amplified 
RNA derived from cDNA that incorporates an RNA polymerase promoter). The target polynucleotides can 
comprise RNA, including, without limitation total cellular RNA, poly(A)'*" messenger RNA (mRNA) or fraction 
thereof, cytoplasmic mRNA, or RNA transcribed from cDNA (i.e., cRNA; see, for example., Linsley & Schelter, 
U.S. patent appUcation Ser. No. 09/41 1,074, or U.S. Pat Nos. 5,545,522, 5,891,636, or 5,716,785). Methods for 
35 preparing total and poly(A)"*" RNA are well known in the art, and are described generally, for example, in 
Sambrook et al., (1989, Molecular Cloning — A Laboratory Manual (2"^ Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y.) and Ausubel et al, eds. (1994, Current Protocols in Moelcular Biology, 
vol. 2, Current Protocols Publishing, New York). RNA may be isolated from eukaryotic cells by procedures 
involving lysis of the cells and denaturation of the proteins contained in the cells. Additional steps may be 
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utilized to remove DNA. Cell lysis may be achieved with a nonionic detergent, followed by microcentrifligation 
to remove the nuclei and hence the bulk of the cellular DNA. (See Chirgwin et al., 1979, Biochemistry 18:5294- 
5299). Poly(A)+RNA can be selected using oligo-dT cellulose (see Sambrook et al, 19S9, Molecular Cloning— 
A Laboratory Manual {27idEd.\ Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). In the 
5 alternative, RNA can be separated from DNA by organic extraction, for example, with hot phenol or 
phenol/chloroform/isoamyl alcohol. 

It may be desirable to enrich mRNA with respect to other cellular RNAs, such as transfer RNA (tRNA) 
and ribosomal RNA (rRNA). Most mRNAs contain a poly(A) tail at their 3 ' end allowing them to be enriched by 
affinity chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as cellulose or 
10 SephadexTM (see Ausubel et al., eds., 1994, Current Protocols in Molecular Biology, vol. 2, Current Protocols 
Publishing, New York). Bound poly(A)+mRNA is eluted from the affinity column using 2 mM EDTA/0.1% 
SDS. 

A sample of RNA can comprise a plurality of different mRNA molecules each with a different 
nucleotide sequence. In an aspect of the invention, the mRNA molecules in the RNA sample comprise at least 
15 100 different nucleotide sequences. 

Target polynucleotides can be detectably labeled at one or more nucleotides using methods known in 
the art. The label is preferably uniformly incorporated along the length of the RNA, and more preferably, is 
carried out at a high degree of efficiency. The detectable label can be a luminescent label, fluorescent label, bio- 
luminescent label, chemi-luminescent label, radiolabel, and colorimetric label. In a particular embodiment, the 

20 label is a fluorescent label, such as a fluorescein, a phosphor, a rhodamine, or a polymethine dye derivative. 
Commercially available fluorescent labels include, for example, fluorescent phosphoramidites such as 
FluorePrime (Amersham Pharmacia, Piscataway, N.J.), Fluoredite (Millipore, Bedford, Mass.), FAM (ABI, 
Foster City, Calif.), and Cy3 or Cy5 (Amersham Pharmacia, Piscataway, N. J.). 

Target polynucleotides from a patient sample can be labeled differentially from polynucleotides of a 

25 standard. The standard can comprise target polynucleotides from normal individuals (i.e., those not afflicted with 
or pre-disposed to endometrikl disease, in particular pooled from samples from normal individuals. The target 
polynucleotides can be derived from the same individual, but taken at different time points, and thus indicate the 
efficacy of a treatment by a change in expression of the markers, or lack thereof, during and after the course of 
treatment. 

30 The terms "subject", "individual" or "patient" refer to a warm-blooded animal such as a mammal. In 

particular, the terms refer to a human. A subject, individual or patient may be afflicted witli or suspected of 
having or being pre-disposed to endometrial disease or a condition as described herein. The term also includes 
domestic animals bred for food or as pets, including horses, cows, sheep, poultry, fish, pigs, cats, dogs, and zoo 
animals. 

35 Methods herein for administering an agent or composition to subjects/individuals/patients contemplate 

treatment as well as prophylactic use. Typical subjects for treatment include persons susceptible to, suffering 
from or that have suffered a condition or disease described herein. In particular, suitable subjects for treatment in 
accordance with the invention are persons that are susceptible to, suffering from or that have suffered 
endometrial cancer. 
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The term "endometrial marker" includes a marker associated with normal or diseased endometrial tissue 
identified using a method of the invention. The term includes native-sequence polypeptides isoforms, chimeric 
polypeptides, complexes, all homologs, fragments, precursors, and modified forms and derivatives of the 
markers. 

5 An endometrial marker may be associated with a stage or phase of endometrial tissue such as the 

secretory or proliferative phase. Examples of endometrial markers associated with the secretory phase are 
glutamate receptor subunit zeta 1 [SBQ ID NO. 26] and parts thereof (e.g. a tryptic fragment of SEQ ID NO. 28), 
macrophage migration inhibitory factor [SEQ ID NO. 18], FRATl [SEQ ID NO. 42], myosin light chain Idnase 2 
[SEQ ID NO. 45], and tropomyosin 1 alpha chain [SEQ ID NO. 47]. 
10 An endometrial marker may be associated with an endometrial disease, in particular it may be an 

endometrial cancer marker. The term "endometrial cancer marker" includes a marker associated with endometrial 
cancer identified using a method of the invention, in particular a marker listed in Table 1. 

In an aspect of the invention, an endometrial cancer marker is chaperonin 10. The term "chaperonin 10", 
"chaperonin 10 polypeptide" or "chaperonin protein" includes human chaperonin 10, in particular the native- 

15 sequence polypeptide, isoforms, chimeric polypeptides, all homologs, fragments, precursors, complexes, and 
modified forms and derivatives of human chaperonin 10. The amino acid sequence for native human chaperonin 
10 includes the sequences of GenBank Accession No. Q04984 and AAH23518 shown in SEQ ID NO. 1. 

A "native-sequence polypeptide" comprises a polypeptide having the same amino acid sequence of a 
polypeptide derived from nature. Such native-sequence polypeptides can be isolated from nature or can be 

20 produced by recombinant or syntlietic means. The term specifically encompasses naturally occurring truncated or 
secreted forms of a polypeptide, polypeptide variants including naturally occurring variant forms (e.g. 
alternatively spliced forms or sphce variants), and naturally occurring allelic variants. 

The term "polypeptide variant" means a polypeptide having at least about 45%, 50%, 55%, 60%, 65%, 
70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% amino acid sequence identity, particularly at least abqut 

25 70- 80%, more particularly at least about 85 %, still more particularly at least about 90%, most particularly at least 
about 95% amino acid sequence identity with a native-sequence polypeptide. Particular polypeptide variants have 
at least 70-80%, 85%, 90%, 95% amino acid sequence identity to the sequences identified in Table 1 , 4, 5, or 6 
(see in particular, GenBank Accession Nos. Q04984 and AAH23518, P05109, P06702, P01833 or Q81ZY7, 
P30086, P39687, P17066, P14174, P31949, P00938 andNP_000356, Q05586, ITHUand P01009, gi/125294 and 

30 P12277, P14618, Q01995, Q14103, NP_852000, NP„444254, andP09493 and AAH07433 and SEQIDNOs. 1, 
3, 6, 9, 11, 13, 15, 18, 21, 23, 26, 30, 33, 36, 38, 40, 42, 45, and 47). Such variants include, for instance, 
polypeptides wherein one or more amino acid residues are added to, or deleted fi-om, the N- or C-terminus of the 
full-length or mature sequences of the polypeptide, including variants from other species, but excludes a native- 
sequence polypeptide. In aspects of the invention variants retain the immunogenic activity of the corresponding 

35 native-sequence polypeptide. 

Percent identity of two amino acid sequences, or of two nucleic acid sequences is defined as the 
percentage of amino acid residues or nucleotides in a candidate sequence that are identical with the amino acid 
residues in a polypeptide or nucleic acid sequence, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions 
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as part of the sequence identity. Alignment for purposes of determining percent amino acid or nucleic acid 
sequence identity can be achieved in various conventional ways, for instance, using publicly available computer 
software including the GCG program package (Devereux J. et al., Nucleic Acids Research 12(1): 387, 1984); 
BLASTP, BLASTN, and FASTA (Atschul, S.R et al. J. Molec. Biol. 215: 403-410, 1990). The BLAST X 
5 program is publicly available from NCBI and other sources (BLAST Manual, Altschul, S. et al, NCBI NLM NTH 
Bethesda, Md. 20894; Altschul, S. et al, J. Mol. Biol. 215: 403-410, 1990). Skilled artisans can determine 
appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment 
over the full length of the sequences being compared. Methods to determine identity and similarity are codified 
in publicly available computer programs. 

10 An allelic variant may also be created by introducing substitutions, additions, or deletions into a 

polynucleotide encoding a native polypeptide sequence such that one or more amino acid substitutions, additions, 
or deletions are introduced into the encoded protein. Mutations maybe introduced by standard methods, such as 
site-directed mutagenesis and PCR-mediated mutagenesis. In an embodiment, conservative substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is 

1 5 one in which an amino acid residue is replaced with an amino acid residue with a similar side chain. Amino acids 
with similar side chains are known in the art and include amino acids with basic side chains (e.g. Lys, Arg, His), 
acidic side chains (e.g. Asp, Glu), uncharged polar side chains (e.g. Gly, Asp, Glu, Ser, Thr, Tyr and Cys), 
nonpolar side chains (e.g. Ala, Val, Leu, Iso, Pro, Trp), beta-branched side chains (e.g. Thr, Val, Iso), and 
aromatic side chains (e.g. Tyr, Phe, Trp, His). Mutations can also be introduced randomly along part or all of the 

20 native sequence, for example, by saturation mutagenesis. Following mutagenesis the variant polypeptide can be 
recombinantly expressed and the activity of the polypeptide may be determined. 

Polypeptide variants include polypeptides comprising amino acid sequences sufficiently identical to or 
derived from the amino acid sequence of a native polypeptide which include fewer amino acids than the full 
length polypeptides. A portion of a polypeptide can be a polypeptide which is for example, 10, 15, 20, 25, 30, 35, 

25 40, 45, 50, 60, 70, 80, 90, 100 or more amino acids in length. Portions in which regions of a polypeptide are 
deleted can be prepared by recombinant techniques and can be evaluated for one or more functional activities 
such as the ability to form antibodies specific for a polypeptide. 

A naturally occurring allelic variant may contain conservative amino acid substitutions from the native 
polypeptide sequence or it may contain a substitution of an amino acid from a corresponding position in a 

30 polypeptide homolog, for example, a murine polypeptide. 

Endometrial markers include chimeric or fusion proteins. A "chimeric protein" or "fusion protein" 
comprises all or part (preferably biologically active) of an endometrial marker operably linked to a heterologous 
polypeptide (i.e., a polypeptide other than an endometrial marker). Within the fusion protein, the term "operably 
linked" is intended to indicate that an endometrial marker and the heterologous polj/peptide are fused in-frame to 

35 each other. The heterologous polypeptide can be fused to the N-terminus or C-terminus of an endometrial 
marker, A useful fusion protein is a GST fusion protein in which an endometrial marker is fused to the C- 
terminus of GST sequences. Another example of a fusion protein is an immunoglobulin fusion protein in which 
all or part of an endometrial marker is fused to sequences derived from a member of the immunoglobulin protein 
family. Chimeric and fusion proteins can be produced by standard recombinant DNA techniques. 



■ 
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A modified form of a polypeptide referenced herein includes modified forms of the polypeptides and 
derivatives of the polypeptides, including post-translationally modified forms such as glycosylated, 
phosphorylated, acetylated, methylated or lapidated forms of the polypeptides. For example, an N-terminal 
methionine may be cleaved from a polypeptide, and a new N-terminal residue may or may not be acetylated. In 
5 particular, for chaperonin 10 the first residue, methionine, can be cleaved and the second first residue, alanine 
can be N-acetylated. 

Endometrial markers identified in accordance with a method of the invention may be prepared by 
recombinant or synthetic methods, or isolated from a variety of sources, or by any combination of these and 
similar techniques. 

10 "Endometrial polynucleotide marker(s)", polynucleotides encoding the marker(s)", and 

"polynucleotides encoding endometrial markers" refer to polynucleotides that encode endometrial markers 
including native-sequenjCe polypeptides, polypeptide variants including a portion of a polypeptide, an isoform, 
precursor, complex, a chimeric polypeptide, or modified forms and derivatives of the polypeptides. An 
endometrial polynucleotide marker includes the polynucleotides encoding the polypeptides listed in Table 1, 4, 5, 

15 or 6. In particular, the invention includes the polynucleotides encoding glutamate receptor subunit zeta 1 [e.g. see 
AccessionNo. D13515 and SEQ ID NO. 27], a tryptic peptide thereof [SEQ ID NO. 28], macrophage migration 
inhibitory factor [e.g. see Accession Nos. NM_002415 and L19686 and SEQ ID NOs. 19 and 20], FRATl [e.g. 
see Accession Nos. NM_005479 and NM_181355 and SEQ ID NOs. 43 and 44], myosin hght chain kinase 2 
[AccessionNo. AF069601 and SEQ ID NO. 46], tropomyosin I alpha chain e.g. see AccessionNo. NM_000366 

20 and BC007433 and SEQ ID NO. 48], and the endometrial markers listed in Table 1 [in particular, GenBank 

Accession Nos. NM_002157 and U07550, A12027, NM_002964, X06233, M21064, NM_002644, NM_002567, 

NM_006305, NM_002155, X51757, NM_002415, L19686, NM_005620 and D38583, N1VL000365, X69723, 

NM_000295,K02212,NM_001823,X15334,X56494,D84342, AF026126, andSEQ ID NOs. 2, 4,5,7, 8, 10, 

♦ 

12, H 16, 17, 19,20, 22,24,25,31,32, 34, 35, 37, 39,41]. In an embodiment, a polynucleotide of the invention 
25 encodes chaperonin 10, more particularly a polynucleotide sequence of GenBank Accession No. NM_002157 
and U07550 [SEQ ID NO 2], or a fragment thereof. 

Endometrial polynucleotide markers include complementary nucleic acid sequences, and nucleic acids 
that are substantially identical to these sequences (e.g. having at least about 45%, 50%, 55%, 60%, 65%, 70%, 
75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity). 
3 0 Endometrial polynucleotide markers also include sequences that differ from a native sequence [e.g. see 

GenBanlc Accession Nos. NM„002157 and U07550, A12027, NM„002964 , X06233, M21064, NM_002644, 
NM_002567, NM_006305, NM_002155, X51757, NM_0024I5, L19686, NM„005620 and D38583, 
NM_000365, X69723, D13515, NM_000295, K02212, NM_001823, X15334, X56494, D84342, AF026126, 
NM_005479, NM_18 1355, AF069601, NM_000366, and BC007433, and SEQ ID NOs. 2, 4, 5, 7, 8, 10, 12, 14, 
35 16, 17, 19, 20, 22, 24, 25, 27, 31, 32, 34, 35, 37, 39, 41, 43, 44, 46, and 48] due to degeneracy in the genetic 
code. As one example, DNA sequence polymorphisms within the nucleotide sequence of an endometrial marker 
may result in silent mutations that do not affect the amino acid sequence. Variations in one or more nucleotides 
may exist among individuals within a population due to natural allelic variation. DNA sequence polymorphisms 
may also occur which lead to changes in the amino acid sequence of a polypeptide. 
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Endometrial polynucleotide markers also include nucleic acids that hybridize under stringent conditions, 
preferably high stringency conditions to an endometrial polynucleotide marker, in particular an endometrial 
cancer polynucleotide marker [e.g. see GenBank Accession Nos, NM_002157 and U07550, A12027, 
NM_002964, X06233, M21064, NM_002644, NM_002567, NM_006305,NM„002155, X51757,NM_002415, 
5 L19686, NM_005620 and D38583, NM_000365, X69723, D13515, NM_000295, K02212, NM_001823, 
X15334, X56494, D84342, AF026126, NM_005479, NM_181355, AF069601, NM_000366, and BC007433, 
and SEQ ID NOs. 2, 4, 5, 7, 8, 10, 12, 14, 16, 17, 19, 20, 22, 24, 25, 27, 3 1, 32, 34, 35, 37, 39, 41, 43, 44, 46, and 
48] . Appropriate stringency conditions which promote DNA hybridization are known to those skilled in the art, 
or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For 
10 example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 2,0 x SSC at 50°C 
may be employed. The stringency may be selected based on the conditions used in the wash step. By way of 
example, the salt concentration in the wash step can be selected from a high stringency of about 0.2 x SSC at 
50°C. In addition, the temperature in the wash step can be at high stringency conditions, at about 65 °C. 

Endometrial polynucleotide markers also include truncated nucleic acids or nucleic acid fragments and 
15 variant forms of the nucleic acids that arise by altemative splicing of an mRNA corresponding to a DNA. 

The endometrial polynucleotide markers are intended to include DNA and RNA (e.g. mRNA) and can 
be either double stranded or single stranded. A polynucleotide may, but need not, include additional coding or 
non-coding sequences, or it may, but need not, be linked to other molecules and/or carrier or support materials. 
The polynucleotides for use in the methods of the invention may be of any length suitable for a particular 
20 method. In certain applications the term refers to antisense polynucleotides (e.g. mRNA or DNA strand in the 
reverse orientation to sense cancer polynucleotide markers). 

"Statistically different levels", "significantly altered levels", or "significant difference" in levels of 
markers in a patient sample compared to a control or standard (e.g. normal levels or levels in other samples from 
a patient) may represent levels that are higher or lower tlian the standard error of the detection assay. In particular 
25 embodiments, the levels may be 1.5, 2, 3, 4, 5, or 6 times higher or lower than the control or standard. 

"Microarray and "array," refer to nucleic acid or nucleotide arrays or protein or peptide arrays that can 
be used to detect biomolecules associated with endometrium or phase threof or endometrial disease, for instance 
to measure gene expression. A variety of arrays are made in research and manufacturing facilities worldwide, 
some of which are available commercially. Byway of example, spotted arrays and in situ synthesized arrays are 
30 two kinds of nucleic acid arrays that differ in the manner in which the nucleic acid materials are placed onto the 
array substrate. A widely used in situ synthesized oligonucleotide array is GeneChip™ made by Affymetrix, Inc. 
Oligonucleotide probes that are 20- or 25-base long can be synthesized in silico on the array substrate. These 
arrays can achieve high densities (e.g., more than 40,000 genes per cm^). Generally spotted arrays have lower 
densities, but the probes, typically partial cDNA molecules, are much longer than 20- or 25-mers. Examples of 
35 spotted cDNA arrays include LifeArray made by Incyte Genomics and DermArray made by IntegriDerm (or 
Invitrogen). Pre-synthesized and amplified cDNA sequences are attached to the substrate of spotted arrays. 
Protein and peptide arrays also are known (see for example, Zhu et al., Science 293:2101 (2001). 

"Binding agent" refers to a substance such as a polypeptide or antibody that specifically binds to one or 
more endometrial markers. A substance "specifically binds" to one or more endometrial markers if is reacts at a 
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detectable level with one or more endometrial markers, and does not react detectably with peptides containing an 
luirelated or different sequence. Binding properties may be assessed using an ELISA, which may be readily 
performed by those skilled in the art (see for example, Newton et al , Develop. Dynamics 197: 1-13, 1993). 

A binding agent may be a ribosome, with or without a peptide component, an aptamer, an RNA 
5 molecule, or a polypeptide. A binding agent may be a polypeptide that comprises one or more endometrial 
marker sequence, a peptide variant thereof, or a non-peptide mimetic of such a sequence. By way of example, a 
chaperonin 10 sequence may be a peptide portion of a chaperonin 10 that is capable of modulating a function 
mediated by chaperonin 10. 

An aptamer includes a DNA or RNA molecule that binds to nucleic acids and proteins. An aptamer that 
1 0 binds to a protein (or binding domain) of an endometrial marker or an endometrial polynucleotide marker can be 
produced using conventional techniques, without undue experimentation. (For example, see the following 
publications describing in vitro selection of aptamers: Klug et al., Mol. Biol. Reports 20:97-107 (1994); Wallis et 
aL, Chem. Biol. 2:543-552 (1995); Ellington, Curr. Biol. 4:427-429 (1994); Lato et al., Chem. Biol. 2:291-303 
(1995); Conrad et al., Mol. Div. 1:69-78 (1995); and Uphoffet al., Curr. Opin. Struct. Biol. 6:281-287 (1996)). 
15 Antibodies for use in the present invention include but are not limited to monoclonal or polyclonal 

antibodies, immunologically active fragments (e.g. a Fab or (Fab)2 fragments), antibody heavy chains, 
humanized antibodies, antibody light chains, genetically engineered single chain Fv molecules (Ladner et al, U.S. 
Pat. No. 4,946,778), chimeric antibodies, for example, antibodies which contain the binding specificity of murine 
antibodies, but in which the remaining portions are of human origin, or derivatives, such as enzyme conjugates or 
20 labeled derivatives. 

Antibodies including monoclonal and polyclonal antibodies, fragments and chimeras, maybe prepared 
using methods known to those skilled in the art. Isolated native or recombinant endometrial markers may be 
utilized to prepare antibodies. (See, for example, Kohler et al. (1975) Nature 256:495-497; Kozbor et al. (1985) J. 
Immunol Methods 81:31-42; Cote et al. (1983) Proc Natl Acad Sci 80:2026-2030; and Cole et al. (1984) Mol 
25 Cell Biol 62: 109-120 forthe preparation of monoclonal antibodies; Huse et al. (1989) Science 246: 1275- 1281 for 
the preparation of monoclonal Fab fragments; and. Pound (1998) Immunochemical Protocols, Humana Press, 
Totowa, N.J for the preparation of phagemid or B-lymphocyfce immunoglobulin libraries to identify antibodies). 
Antibodies specific for an endometrial marker may also be obtained from scientific or commercial sources. 

In an embodiment of the invention, antibodies are reactive against an endometrial marker if they bind 
30 with a Ka of greater than or equal to 10"^ M. 
Markers 

The invention provides a set of markers correlated with endometrium or phase thereof, or endometrial 
disease. A subset of these markers identified as usefiil for detection, diagnosis, prevention and therapy of 
endometrial disease is listed in Table 1 or 5. A subset of these markers identified as useful for detection and 
35 diagnosis of endometrium phase is listed in Table 4 or 6. The invention also provides a method of using these 
markers to distinguish endometrium phase or to distinguish endometrial disease. 

The invention provides marker sets that distinguish endometrium phase or endometrial disease and uses 
therefor. In an aspect, the invention provides a method for classifying an endometrium phase or endometrial 
disease comprising detecting a difference in the expression of a first plurality of endometrial markers or 
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endometrial polynucleotide markers relative to a control, the first plurality of endometrial markers or endometrial 
polynucleotide markers consisting of at least 5 of the markers listed in Table 1, 4, 5, or 6. In specific aspects, the 
plurality of markers consists of at least 10 of the markers listed in Table 1, 4, 5, or 6. In anotlier specific aspect, 
the control comprises markers derived from a pool of samples from individual patients with no endometrial 
5 disease, or individuals with a known endometrium phase. 

Any of the markers provided herein may be used alone or with other markers of endometrium phase or 
endometrial disease, or with markers for other phenotypes or conditions. 
Identification of Endometrial Markers 

The invention relates to a method for identifying markers associated with tlie endometrium or a phase 
10 thereof, or associated with a disease of the endometrium comprising: 

(a) obtaining a sample of endometrium from a subject; 

(b) extracting proteins from the sample and producing a profile of the proteins by subjecting the 
proteins to mass spectrometry; and 

(c) comparing the projRle with a profile for normal endometrial tissue or for a known 
15 endometrium phase to identify proteins associated with the endometrium phase or an 

endometrial disease. 

Proteins maybe extracted from the samples in a manner known in the art. For example, proteins maybe 
extracted by first digesting or disrupting cell membranes by standard methods such as detergents or 
homogenization in an isotonic sucrose solution, followed by ultra-centrifugation or other standard techniques. 

20 The separated proteins may be digested into peptides, in particular using proteolytic enzymes such as 

trypsin, pepsin, subtilisin, and proteinase. For example, proteins may be treated with trypsin which cleaves at the 
sites of lysine and arginine, to provide doubly-charged peptides with a length of from about 5 to 50 amino acids. 
Such peptides may be particularly appropriate for mass spectrometry analysis, especially electrospray ionization 
mass spectrometry. Chemical reagents including cyanogens bromide may also be utilized to digest proteins. 

25 Mass spectrometers that may be used to analyze the peptides or proteins include a Matrix- Assisted 

Laser Desorptioon/Ioniation Time-of-Flight Mass Spectrometer ("MALDI-TOF") (e.g. from PerSeptive 
Biosystems, Framingham, Mass.); an Electrospray Ionization ("ESI") ion trap spectrometer, (e.g. from Finnigan 
MAT, San Jose, Calif.), an ESI quadrupole mass spectrometer (e.g. from Finnigan or Perldn-Blmer Corporation, 
Foster City, Calif.), a quadrupole/TOF hybrid tandem mass spectrometer, QSTAR XL (Applied 

30 Biosystems/MDS Sciex), or a Surface Enhanced Laser Desorption/Ionization (SELDI-TOF) Mass Spectrometer 
(e.g. from Ciphergen Biosystems Inc.). 
Detection Methods 

A variety of methods can be employed for the diagnostic and prognostic evaluation of endometrial 
disease or endometrial status involving one or more endometrial markers and polynucleotides encoding the 
35 markers, and the identification of subjects with a predisposition to endometrial diseases or that are receptive to in 
vitro fertilization and embryo transfer procedures. Such methods may, for example, utilize endometrial 
polynucleotide markers, and fragments thereof, and binding agents (e.g. antibodies) against one or more 
endometrial markers, including peptide fragments. In particular, the polynucleotides and antibodies maybe used, 
for example, for (1) the detection of the presence of endometrial polynucleotide marker mutations, or the 
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detection of either over- or under-expression of endometrial marker mRNA relative to a non-disorder state or 
different endometrium phase, or the qualitative or quantitative detection of alternatively spliced forms of 
endometrial polynucleotide marker transcripts which may correlate with certain conditions or susceptibility 
toward such conditions; and (2) the detection of either an over- or an under-abundance of one or more 
5 endometrial markers relative to a non- disorder state or a different endometrium phase or the presence of a 
modified (e.g., less than fiill length) endometrial marker which correlates with a disorder state or a progression 
toward a disorder state, or a particular endometrium phase. 

The invention contemplates a method for detecting the phase of an endometrial tissue, in particular a 
secretory endometrial tissue, comprising producing a profile of levels of one or more endometrial marker 

1 0 associated with a known endometrium phase and/or polynucleotides encoding the markers, and optionally other 
markers associated with the endometrium phase in cells from a patient, and comparing the profile with a 
reference to identify a profile for the test cells indicative of the endometrium phase. In an aspect, the endometrial 
markers are one or more of glutamate receptor subunit zeta 1, macrophage migration inhibitory factor, FRATl, 
myosin light chain kinase 2, tropomyosin 1 alpha chain, and fragments thereof. 

15 The invention also contemplates a method for detecting an endometrial disease, in particular an 

endometrial cancer, comprising producing a profile of levels of one or more endometrial marker associated with 
an endometrial disease and/or polynucleotides encoding the markers, and other markers associated with 
endometrial disease in cells from a patient, and comparing the profile with a reference to identify a profile for the 
test cells indicative of disease. In an aspect, the endometrial markers are one or more of chaperonin 10, 

20 calgranulin A, calgranulin B, polymeric-immunoglobulin receptor (precursor), phosphatidylethanolamine- 
binding protein, acidic leucine-rich nuclear piiosphoprotein 32 family member A, heat shock 70 kDa protein 6, 
macrophage migration inhibitory factor, calgizzarin (SIOOC protein), triosephosphate isomerase, alpha- 1- 
antitrypsin precursor, creatine kinase, B chain, (B-CK), pyruvate. Ml or M2 isozyme, transgelin (smooth muscle 
protein 22-alpha), and heterologous nuclear ribonucleoprotein DO. 

25 The methods described herein may be used to evaluate the probability of the presence of malignant or 

pre-malignant cells, for example, in a group of cells freshly removed from a host. Such methods can be used to 
detect tumors, quantitate their growth, and help in the diagnosis and prognosis of endometrial disease. The 
methods can be used to detect the presence of cancer metastasis, as well as confirm the absence or removal of all 
tumor tissue following surgery, cancer chemotherapy, and/or radiation therapy. They can further be used to 

30 monitor cancer chemotherapy and tumor reappearance. 

The methods described herein can be adapted for diagnosing and monitoring endometrial tissue status or 
an endometrial disease by detecting one or more endometrial markers or polynucleotides encoding the markers in 
biological samples from a subject. These applications require that the amount of markers or polynucleotides 
quantitated in a sample from a subject being tested be compared to a predetermined standard or cut-off value. 

35 The standard may correspond to levels quantitated for another sample or an earlier sample from the subject, or 
levels quantitated for a control sample. Levels for control samples from healthy subjects, different endometrial 
tissue phases, or subjects with an endometrial disease may be established by prospective and/or retrospective 
statistical studies. Healthy subjects who have no clinically evident disease or abnormalities may be selected for 
statistical studies. Diagnosis may be made by a finding of statistically different levels of detected endometrial 
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markers associated with disease or polynucleotides encoding same, compared to a control sample or previous 
levels quantitated for the same subject. 

The methods described herein may also use multiple markers for an endometrial disease, in particular 
endometrial cancer. Therefore, the invention contemplates a method for anaylzing a biological sample for the 
5 presence of one ormore endometrial markers and polynucleotides encoding the markers, and other markers that 
are specific indicators of an endometrial disease. The methods described herein maybe modified by including 
reagents to detect the additional markers, or polynucleotides for the markers. 
Nucleic Acid Methods/Assays 

As noted herein an endometrial disease or phase may be detected based on the level of endometrial 
1 0 polynucleotide markers in a sample. Techniques for detecting polynucleotides such as polymerase chain reaction 
(PGR) and hybridization assays are well known in the art. 

Probes may be used in hybridization techniques to detect endometrial polynucleotide markers. The 
technique generally involves contacting and incubating nucleic acids (e.g. recombinant DNA molecules, cloned 
genes) obtained from a sample from a patient or other cellular source with a probe under conditions favorable for 
1 5 the specific annealing of the probes to complementary sequences in the nucleic acids. After incubation, the non- 
annealed nucleic acids are removed, and the presence of nucleic acids that have hybridized to the probe if any are 
detected. 

Nucleotide probes for use in the detection of nucleic acid sequences in samples may be constructed 
using conventional methods Icnown in the art. Suitable probes may be based on nucleic acid sequences encoding 

20 at least 5 sequential amino acids from regions of an endometrial polynucleotide marker, preferably they comprise 
10-200, more particularly 10-30, 10-40, 20-50, 40-80, 50-150, 80-120 nucleotides in length. 

The probes may comprise DNA or DNA mimics (e.g., derivatives and analogues) corresponding to a 
portion of an organism's genome, or complementary RNA or RNA mimics. Mimics are polymers comprising 
subunits capable of specific, Watson-Crick-like hybridization with DNA, or of specific hybridization with RNA. 

25 The nucleic acids can be modified at the base moiety, at the sugar moiety, or at the phosphate backbone, 

DNA can be obtained using standard methods such as polymerase chain reaction (PGR) amplification of 
genomic DNA or cloned sequences. (See, for example, in Innis et al., eds., 1990, PGR Protocols: A Guide to 
Methods and Applications, Academic Press Inc., San Diego, Calif.). Computer programs known in the art can be 
used to design primers with the required specificity and optimal amplification properties, such as Oligo version 

30 5.0 (National Biosciences). Controlled robotic systems may be useful for isolating and amplifying nucleic acids. 

A nucleotide probe may be labeled with a detectable substance such as a radioactive label that provides 
for an adequate signal and has sufficient half-life such as ^^P, ^H, ^*C or the like. Other detectable substances that 
may be used include antigens that are recognized by a specific labeled antibody, fluorescent compounds, 
enzymes, antibodies specific for a labeled antigen, and luminescent compounds. An appropriate label may be 

35 selected having regard to the rate of hybridization and binding of the probe to the nucleotide to be detected and 
the amount of nucleotide available for hybridization. Labeled probes may be hybridized to nucleic acids on solid 
supports such as nitrocellulose filters or nylon membranes as generally described in Sambrook et al, 1989, 
Molecular Cloning, A Laboratory Manual (2nd ed.). The nucleic acid probes maybe used to detect endometrial 
polynucleotide markers, preferably in human cells. The nucleotide probes may also be useful in the diagnosis of 
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an endometrial disease involving one or more endometrial polynucleotide markers, in monitoring the progression 
of such disorder, or monitoring a therapeutic treatment. 

The detection of endometrial polynucleotide markers may involve the amplification of specific gene 
sequences using an amplification method such as polymerase chain reaction (PCR), followed by the analysis of 
5 the amplified molecules using techniques known to those skilled in the art. Suitable primers can be routinely 
designed by one of skill in the art. 

By way of example, at least two oligonucleotide primers may be employed in a PCR based assay to 
amplify a portion of a polynculeotide encoding one or more endometrial marker derived fi*om a sample, wherein 
at least one of the oligonucleotide primers is specific for (i.e. hybridizes to) a polynucleotide encoding the 
1 0 endometrial marker. The amplified cDN A is then separated and detected using techniques well known in the art, 
such as gel electrophoresis. 

In order to maximize hybridization under assay conditions, primers and probes employed in the methods 
of the invention generally have at least about 60%, preferably at least about 75%, and more preferably at least 
. about 90% identity to a portion of a polynucleotide encoding an endometrial marker; that is, they are at least 1 0 
1 5 nucleotides, and preferably at least 20 nucleotides in length. In an embodiment the primers and probes are at least 
about 10-40 nucleotides in length. 

Hybridization and amplification techniques described herein may be used to assay qualitative and 
quantitative aspects of endometrial polynucleotide marker expression. For example, RNA maybe isolated from a 
cell type or tissue known to express an endometrial polynucleotide marker and tested utilizing the hybridization 
20 (e.g. standard Northern analyses) or PCR techniques referred to herein. 

The primers and probes may be used in the above-described methods in situ i.e directly on tissue 
sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections. 

In an aspect of the invention, a method is provided employing reverse transcriptase-polymerase chain 
reaction (RT-PCR), in which PCR is applied in combination with reverse transcription. Generally, RNA is 
25 extracted fiom a sample tissue using standard techinques (for example, guanidine isothiocyanate extraction as 
described by Chomcynski and Sacchi, Anal. Biochem. 162:156-159, 1987) and is reverse transcribed to produce 
cDNA. The cDNA is used as a template for a polymerase chain reaction. The cDNA is hybridized to a set of 
primers, at least one of which is specifically designed against an endometrial marker sequence. Once the pimer 
and template have annealed a DNA polymerase is employed to extend from the primer, to synthesize a copy of 
30 the template. The DNA strands are denatured, and the procedure is repeated many times until sufficient DNA is 
generated to allow visualization by ethidium bromide staining and agarose gel electrophoresis. 

Amplification may be performed on samples obtained from a subject with a suspected endometrial 
disease and an individual who is not afflicted with an endometrial disease. The reaction may be performed on 
several dilutions of cDNA spanning at least two orders of magnitude. A statistically significant difference in 
35 expression in several dilutions of the subject sample as compared to the same dilutions of the non-disease sample 
may be considered positive for the presence of an endometrial disease. 

In an embodiment, the invention provides methods for determining the presence or absence of an 
endometrial disease in a subject comprising (a) contacting a sample obtained firom the subject with 
oligonucleotides that hybridize to endometrial polynucleotide markers; and (b) detecting in the sample a level of 
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nucleic acids that hybridize to the polynucleotides relative to a predetermined cut-off value, and therefrom 
determining the presence or absence of an endometrial disease in the subject. In an aspect, the endometrial 
disease is cancer and the endometrial markers are one or more of chaperonin 10, calgranulin A, calgranulin B, 
polymeric-immunoglobulin receptor (precursor), phosphatidylethanolamine-binding protein, acidic leucine-rich 
5 nuclear phosphoprotein 32 family member A, heat shock 70 kDa protein 6, macrophage migration inhibitory 
factor, calgizzarin (S lOOC protein), triosephosphate isomerase, alpha- 1 -antitrypsin precursor, creatine kinase, B 
chain, (B-CK), pyruvate. Ml or M2 isozyme, transgelin (smooth muscle protein 22-alpha), and heterologous 
nuclear ribonucleoprotein DO. 

In another embodiment, the invention provides methods for determining uterine receptivity of a subject 

10 to in vitro fertilization comprising' (a) contacting a sample obtained from the subject with oligonucleotides that 
hybridize to endometrial polynucleotide markers associated with an endometrial tissue phase (e.g. secretory 
phase); and (b) detecting in the sample a level of nucleic acids that hybridize to the polynucleotides relative to a 
predetermined cut-off value, wherein the presence or absence of the endometrial marker polynucleotides as 
compared to non-receptive controls indicates uterine receptivity. . In an aspect, the endometrial markers are one 

15 or more of glutamate receptor subunit zeta 1, macrophage migration inhibitory factor, FRATl, myosin light 
chain kinase 2, tropomyosin 1 alpha chain, and fragments thereof 

The invention provides a method wherein an endometrial marker mRNA is detected by (a) isolating 
mRNA from a sample and combining the mRNA with reagents to convert it to cDNA; (b) treating the converted 
cDNA with amplification reaction reagents and nucleic acid primers that hybridize to one or more endometrial 

20 marker polynucleotides, to produce amplification products; (d) analyzing the amplification products to detect 
amounts of mRNA encoding endometrial polynucleotide markers; and (e) comparing the amount of mRNA to 
an amount detected against a panel of expected values for normal and malignant tissue derived using similar 
nucleic acid primers. 

Endometrial cancer marker-positive samples or alternatively higher levels in patients compared to a 
25 control (e.g. non-cancerous tissue) may be indicative of late stage disease, and/or that the patient is not 
responsive to chemotherapy. Alternatively, negative samples or lower levels compared to a control (e.g. non- 
cancerous tissue or negative samples) may also be indicative of progressive disease and shorter overall survival. 

In another embodiment, the invention provides methods for determining the presence or absence of 
endometrial cancer in a subject comprising (a) contacting a sample obtained from the subject with 
30 oligonucleotides that hybridize to one or more endometrial cancer polynucleotide markers; and (b) detecting in 
the sample levels of nucleic acids that hybridize to the polynucleotides relative to a predetermined cut-off value, 
and therefrom determining the presence or absence of endometrial cancer in the subject. In an embodiment, the 
endometrial cancer polynucleotide markers encode one or more polypeptides listed in Table 1. In particular, the 
endometrial markers are one or more of chaperonin 10, calgranulin A, calgranulin B, polymeric-immimoglobulin 
35 receptor (precursor), phosphatidylethanolamine-binding protein, acidic leucine-rich nuclear phosphoprotein 32 
family member A, heat shock 70 kDa protein 6, macrophage migration inhibitory factor, calgizzarin (SIOOC 
protein), triosephosphate isomerase, alpha- 1 -antitrypsin precursor, creatine kinase, B chain, (B-CK), pyruvate, 
Ml or M2 isozyme, transgelin (smooth muscle protein 22-alpha), and heterologous nuclear ribonucleoprotein 
DO, and fragments thereof. 
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In particular, the invention provides a method wherein chaperonin 10 mRNA is detected by (a) isolating 
mRNA from a sample and combining the mRNA with reagents to convert it to cDNA; (b) treating the converted 
cDNA with amplification reaction reagents and nucleic acid primers that hybridize to a polynucleotide encoding 
chaperonin 10, to produce amplification products; (d) analyzing the amplification products to detect an amount 
5 of mRNA encoding chaperonin 10; and (e) comparing the amount of mRNA to an amoimt detected against a 
panel of expected values for normal and malignant tissue derived using similar nucleic acid primers. 

Endometrial cancer marker-positive samples or alternatively higher levels, in particular significantly 
higher levels of chaperonin 10 polynucleotides in patients compared to a control (e.g. normal or benign) are 
indicative of endometrial cancer. Negative samples or lower levels compared to a control (e.g. normal or benign) 

10 may also be indicative of progressive disease and poor overall survival. 

Oligonucleotides or longer fragments derived from an endometrial cancer polynucleotide marker may 
be used as targets in a microarray. The microarray can be used to simultaneously monitor the expression levels of 
large numbers of genes and to identify genetic variants, mutations, and polymorphisms. The information from 
the microarray may be used to determine gene function, to understand the genetic basis of a disorder, to diagnose 

15 a disorder, and to. develop and monitor the activities of therapeutic agents. 

The preparation, use, and analysis of microarrays are well known to a person skilled in the art. (See, for 
example, Brennan, T. M. et al. (1995) U.S. Pat. No, 5,474,796; Schena, et al. (1996) Proc. Natl. Acad. Sci. 
93:10614-10619; Baldeschweiler et al. (1995), PCT Application W095/251116; Shalon, D, et al. (I 995) PCT 
application WO95/35505; Heller, R. A. et al. (1997) Proc. Natl. Acad- Sci. 94:2150-2155; and Heller, M. J. et al. 

20 (1997) U.S. Pat. No. 5,605,662.) 

Thus, the invention also includes an array comprising one or more endometrial polynucleotide markers 
(in particular the markers listed in Table 1, 4, 5, or 6) and optionally other markers. The array can be used to 
assay expression of endometrial polynucleotide markers in the array. The invention allows the quantitation of 
expression of one or more endometrial polynucleotide markers. 

25 Microarrays typically contain at separate sites nanomolar quantities of individual genes, cDNAs, or 

ESTs on a substrate (e.g.nitrocellulose or silicon plate), or photolithographically prepared glass substrate. The 
arrays are hybridized to cDNA probes using conventional techniques with gene-specific primer mixes. The target 
polynucleotides to be analyzed are isolated, amplified and labeled, typically with fluorescent labels, radiolabels . 
or phosphorous label probes. After hybridization is completed, the array is inserted into the scanner, where 

30 patterns of hybridization are detected. Data are collected as light emitted from the labels incorporated into the 
target, which becomes bound to the probe array. Probes that completely match the target generally produce 
stronger signals than those that have mismatches. The sequence and position of each probe on the array are 
known, and thus by complementarity, the identity of the target nucleic acid appUed to the probe array can be 
determined. 

35 Microarrays are prepared by selecting polynucleotide probes and immobilizing them to a solid support 

or surface. The probes may comprise DNA sequences, RNA sequences, copolymer sequences of DNA and RNA, 
DNA and/or RNA analogues, or combinations thereof. The probe sequences may be full or partial fragments of 
genomic DNA, or they may be synthetic oligonucleotide sequences synthesized either enzymatically in vivo, 
enzymatically z« vitro (e.g., by PGR), or non-enzymatically in vitro. 
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The probe or probes used in the methods of the invention can be immobilized to a solid support or 
surface wliich may be either porous or non-porous. For example, the probes can be attached to a nitrocellulose or 
nylon membrane or filter covalently at either the 3' or the 5' end of the polynucleotide probe. The solid support 
may be a glass or plastic surface. In an aspect of the invention, hybridization levels are measured to microarrays 
5 of probes consisting of a solid support on the surface of which are immobilized a population of polynucleotides, 
such as a population of DNA or DNA mimics, or, alternatively, a population of RNA or RNA mimics. A solid 
support may be a nonporous or, optionally, a porous material such as a gel. 

In accordance with embodiments of the invention, a microarray is provided comprising a support or 
surface with an ordered array of hybridization sites or "probes" each representing one of the markers described 
1 0 herein. The microarrays can be addressable arrays, and in particular positionally addressable arrays. Each probe 
of the array is typically located at a known, predetermined position on the solid support such that the identity of 
each probe can be determined from its position in the array. In preferred embodiments, each probe is covalently 
attached to the solid support at a single site. 

Microarrays used in the present invention are preferably (a) reproducible, allowing multiple copies of a 
15 given array to be produced and easily compared with each other; (b) made from materials that are stable under 
hybridization conditions; (c) small, (e.g., between 1 cm^ and 25 cm^, between 12 cm^ and 13 cm^, or 3 cm^; and 
(d) comprise a unique set of binding sites that will specifically hybridize to the product of a single gene in a cell 
(e.g., to a specific mRNA, or to a specific cDNA derived therefrom). However, it will be appreciated that larger 
arrays may be used particularly in screening arrays, and other related or similar sequences will cross hybridize to 
20 a given binding site. 

In accordance with an aspect of the invention, the microarray is an array in which each position 
represents one of the markers described herein. Each position of the array can comprise a DNA or DNA analogue 
based on genomic DNA to which a particular RNA or cDNA transcribed from a genetic marker can specifically 
hybridize. A DNA or DNA analogue can be a synthetic oUgomer or a gene fragment; In an embodiment, probes 
25 representing each of the endometrial markers and endometrial polynucleotide markers is present on the array. In 
a preferred embodiment, the array comprises at least 5 of the endometrial polynucleotide markers. 

Probes for the microarray can be synthesized using N-phosphonate or pliosphoramidite chemistries 
(Froehler et al., 1986, Nucleic Acid Res. 14:5399-5407; McBride et al., 1983, Tetrahedron Lett. 24:246-248). 
Synthetic sequences are typically between about 10 and about 500 bases, 20-100 bases, or 40-70 bases in length. 
30 Synthetic nucleic acid probes can include non-natural bases, such as, without limitation, inosine. Nucleic acid 
analogues such as peptide nucleic acid may be used as binding sites for hybridization, (see, e.g., Egholm et al., 
1993, Nature 363:566-568; U.S. Pat. No. 5,539,083). 

Probes can be selected using an algorithm that takes into account binding energies, base composition, 
sequence complexity, cross-hybridization binding energies, and secondary structure (see Friend et al., 
35 International Patent Publication WO 01/05935, published Jan. 25, 2001). 

Positive control probes, (e.g., probes known to be complementary and hybridizae to sequences in the 
target polynucleotides), and negative control probes, (e.g., probes known to not be complementary and hybridize 
to sequences in the target polynucleotides) are typically included on the array. Positive controls can be 
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synthesized along the perimeter of the array or. synthesized in diagonal stripes across the array. A reverse 
complement for each probe can be next to the position of the probe to serve as a negative control. 

The probes can be attached to a solid support or surface, which may be made from glass, plastic (e.g., 
polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous or nonporous material. The probes 
5 can be printed on surfaces such as glass plates (see Schena et al., 1995, Science 270:467-470). This method may 
be particularly useful for preparing microarrays of cDNA (See also, DeRisi et al., 1996, Nature Genetics 14:457- 
460; Shalon et al,, 1996, Genome Res. 6:639-645; and Schena et al., 1995, Proc. Natl. Acad. Sci. U.S.A. 
93:10539-11286). 

High-density oligonucleotide arrays containing thousands of oligonucleotides complementary to defined 

10 sequences, at defined locations on a surface can be produced using photolithographic techniques for synthesis in 
situ (see, Fodor et al., 1991, Science 251:767-773; Pease et al, 1994, Proc. Natl. Acad. Sci. U.S.A. 91:5022- 
5026; Lockhart et al., 1996, Nature Biotechnology 14: 1675; U.S. Pat Nos. 5,578,832; 5,556,752; and 5,5 10,270) 
or other methods for rapid synthesis and deposition of defined oligonucleotides (Blanchard et al.. Biosensors & 
Bioelectronics 11:687-690). Using these methods oligonucleotides (e.g., 60-mers) of known sequence are 

15 synthesized directly on a surface such as a derivatized glass slide. The array produced may be redundant, with 
several oligonucleotide molecules per RNA. 

Microarrays can be made by other methods including masking (Maskos and Southern, 1992, Nuc. 
Acids. Res. 20: 1 679- 1684). In an embodiment, microarrays of the present invention are preduced by synthesizing 
polynucleotide probes on a support wherein the nucleotide probes are attached to the support covalently at either 

20 the 3' or the 5' end of the polynucleotide. 

The invention provides microarrays comprising a disclosed marker set. In one embodiment, the 
invention provides a microarray for distinguishing endometrial disease samples comprising a positionally- 
addressable array of polynucleotide probes bound to a support, the polynucleotide probes comprising a plurality 
of polynucleotide probes of different nucleotide sequences, each of the different nucleotide sequences comprising 

25 a sequence complementary and hybridizable to a plurality of genes, the plurality consisting of at least 2, 3, 4, 5, 
6, 7, 8, 9, 10, 1 1, 12, 13, 14, or 15 of the genes corresponding to the markers listed in Table 1, 4, 5, or 6. An 
aspect of the invention provides microarrays comprising at least 5, 10, or 15 of the polynucleotides encoding the 
markers listed in Table 1, 4, 5, or 6. 

The invention provides gene marker sets that distinguish endometrium phase or endometrial disease and 

30 uses therefor. In an aspect, the invention provides a metliod for classifying an endometrium phase or disease 
comprising detecting a difference in the expression of a first plurality of genes relative to a control, the first 
plurality of genes consisting of at least 5 of the genes encoding the markers listed in Table 1, 4, 5, or 6. In 
specific aspects, the plurality of genes consists of at least 10 or 15 of the genes encoding the markers listed in 
Table 1, 4, 5, or 6. In another specific aspect, the control comprises nucleic acids derived from a pool of samples 

35 from individual control patients. 

The invention provides a method for classifying an endometrium phase or endometrial disease by 
calculating the similarity between the expression of at least 5 polynucleotides encoding markers listed in Table 1, 
4, 5, or 6 in a sample to the expression of the same markers in a control pool, comprising the steps of: 
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(a) labeling nucleic acids derived from a sample, with a first fluorophore to obtain a first pool of 
fluorophore-labeled nucleic acids; 

(b) labeling with a second fluorophore a first pool of nucleic acids derived from two or more 
endometrial disease samples, and a second pool of nucleic acids derived from two or more 

5 control samples; 

(c) contacting the first fluorophore-labeled nucleic acid and the first pool of second fluorophore- 
labeled nucleic acid with a first microarray under conditions such that hybridization can occur, 
and contacting the first fluorophore-labeled nucleic acid and the second pool of second 
fluorophore-labeled nucleic acid with a second microarray under conditions such that 

1 0 hybridization can occur, detecting at each of a plurality of discrete loci on the first microarray 

a first flourescent emission signal from the first fluorophore-labeled nucleic acid and a second 
fluorescent emission signal from the first pool of second fluorophore-labeled genetic matter 
that is bound to the first microarray and detecting at each of the marker loci on the second 
microarray the first fluorescent emission signal from the first fluorophore-labeled nucleic acid 

15 and a third fluorescent emission signal from the second pool of second fluorophore-labeled 

nucleic acid; 

(d) determining the similarity of the sample to patient and contol pools by comparing the first 
fluorescence emission signals and the second fluorescence emission signals, and the first 
emission signals and the third fluorescence emission signals; and 

20 (e) classifying the sample as endometrial disease where the first fluorescence emission signals are 

more similar to the second fluorescence emission signals than to the third fluorescent emission 
signals, and classifying the sample as non-endometrial disease where the first fluorescence 
emission signals are more similar to the third fluorescence emission signals than to the second 
fluorescent emission signals, wherein the first microarray and the second microarray are 
25 similar to each other, exact replicas of each other, or are identical, and wherein the similarity 

is defined by a statistical method such that the cell sample and control are similar where the p 
value of the similarity is less than 0.01. 
In aspects of the invention, the array can be used to monitor the time course of expression of one or 
more endometrial poljoiucleotide markers in the array. This can occur in various biological contexts such as 
30 tumor progression. 

The array is also useful for ascertaining differential expression patterns of endometrial polynucleotide 
markers, and optionally other markers, in normal and abnormal cells. This may provide a battery of nucleic acids 
that could serve as molecular targets for diagnosis or therapeutic intervention. 
Protein Methods 

35 Binding agents may be used for a variety of diagnostic and assay applications. There are a variety of 

assay formats known to the skilled artisan for using a binding agent to detect a target molecule in a sample. (For 
example, see Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988). In 
general , the presence or absence of an endometrial disease (e.g. cancer) or an endometrium phase in a subject 
may be determined by (a) contacting a sample from the subject with a binding agent; (b) detecting in the sample 
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a level of polypeptide that binds to the binding agent; and (c) comparing the level of polypeptide with a 
predetermined standard or cut-ofF value. 

In particular embodiments of the invention, the binding agent is an antibody. Antibodies specifically 
reactive with one or more endometrial marker, or derivatives, such as enzyme conjugates or labeled derivatives, 

5 may be used to detect one or more endometrial marker in various samples (e.g. biological materials). They may 
be used as diagnostic or prognostic reagents and they may be used to detect abnormalities in the level of 
expression of one or more endometrial marker, or abnormalities in the structure, and/or temporal, tissue, cellular, 
or subcellular location of one or more endometrial marker. Antibodies may also be used to screen potentially 
therapeutic compounds in vitro to determine their effects on disorders (e.g. endometrial cancer) involving one or 

10 more endometrial markers, and other conditions. In vitro immunoassays may also be used to assess or monitor 
the efficacy of particular therapies. 

In an aspect, the invention provides a method for monitoring or diagnosing an endometrial disease (e.g. 
cancer) in a subject by quantitating one or more endometrial markers in a biological sample from the subject 
comprising reacting the sample with antibodies specific for one or more endometrial markers, which are directly 

15 or indirectly labeled with detectable substances and detecting the detectable substances. In a particular 
embodiment of the invention, endometrial markers are quantitated or measured. 

In an aspect of the invention, a method for detecting an endometrial disease (e.g. cancer) is provided 
comprising: 

(a) obtaining a sample suspected of containing one or more endometrial markers associated with 
20 an endometrial disease; 

(b) contacting said sample with antibodies that specifically bind to the endometrial markers under 
conditions effective to bind the antibodies and form complexes; 

(c) measuring the amount of endometrial markers present in the sample by quantitating the 
amount of the complexes; and 

25 (d) comparing the amount of endometrial markers present in the samples with the amount of 

endometrial markers in a control, wherein a change or significant difference in the amount of 
endometrial markers in the sample compared with the amount in the control is indicative of an 
endometrial disease. 

In an embodiment, the invention contemplates a method for monitoring the progression of an 
30 endometrial disease (e.g. cancer) in an individual, comprising: 

(a) contacting antibodies which bind to one or more endometrial markers with a sample from the 
individual so as to form complexes comprising the antibodies and one or more endometrial 
markers in the sample; 

(b) determining or detecting the presence or amount of complex formation in the sample; 
35 (c) repeating steps (a) and (b) at a point later in time; and 

(d) comparing the result of step (b) with the result of step (c), wherein a difference in the amount 
of complex formation is indicative of disease, disease stage, and/or progression of the disease 
in said individual. 
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Tlie amount of complexes may also be compared to a value representative of the amount of the 
complexes from an individual not at risk of, or afflicted v^ith, an endometrial disease at different stages. A 
significant difference in complex formation maybe indicative of advanced disease e.g. advanced endometrial 
cancer, or an unfavourable prognosis. 
5 In aspects of the invention for diagnosis and monitoring of endometrial cancer, the endometrial markers 

are one or more of chaperonin 10, calgranulin A, calgranulinB,polymeric-immunoglobulin receptor (precursor), 
phosphatidylethanolamine-binding protein, acidic leucine-rich nuclear phosphoprotein 32 family member A, heat 
shock 70 kDa protem 6, macrophage migration inhibitory factor, calgizzarin (SIOOC protein), triosephosphate 
isomerase, alpha- 1 -antitrypsin precursor, creatine kinase, B chain, (B-CK), pyruvate. Ml or M2 isozyme, 
10 transgelin (smooth muscle protein 22-alpha), and heterologous nuclear ribonucleoprotein DO, or fragments 
thereof. 

hi embodiments of the methods of the invention, chaperonin 10 is detected in samples andhiglier levels, 
in particular significantly higher levels compared to a control (normal or benign) is indicative of endometrial 
cancer. 

15 In aspects of the invention for characterizing endometrium phase tlie endometrial markers are one or 

more of glutamate receptor subunit zeta 1, macrophage migration inhibitory factor, FRATl, myosin light chain 
kinase 2, tropomyosin 1 alpha chain, and fragments thereof. 

In another embodiment, the invention provides methods for determining uterine receptivity of a subject 
to in vitro fertilization comprising (a) contacting a sample obtained from the subject with antibodies that bind to 

20 one or more endometrial marker associated with a certain endometrium phase (e.g. secretory phase); and (b) 
detecting in the sample a level of endometrial marker relative to a predetermined cut-off value, wherein the 
presence or absence of the endometrial marker as compared to non-receptive controls indicates uterine 
receptivity. In a particular embodiment, the marker is one or more of glutamate receptor subunit zeta 1, 
macrophage migration inhibitory factor, FJRATl, myosin light chain kinase 2, tropomyosin 1 alpha chain, and 

25 fragments thereof, more particularly glutamate receptor subunit zeta 1 or a fragment thereof, and/or macrophage 
migration inhibitory factor. 

Antibodies may be used in any known immunoassays that rely on the binding interaction between 
antigenic determinants of one or more endometrial marker and the antibodies. Immunoassay procedures for in 
vitro detection of antigens in fluid samples are also well Icnown in the art. [See for example, Paterson et al., Int. J. 

30 Can. 37:659 (1986) and Burchell et al., hit. J. Can. 34:763 (1984) for a general description of immunoassay 
procedures]. Qualitative and/or quantitative determinations of one or more endometrial marker in a sample may 
be accompHshed by competitive or non-competitive immunoassay procedures in either a direct or indirect 
format. Detection of one or more endometrial marker using antibodies can be done utilizing immunoassays 
which are run in either the forward, reverse or simultaneous modes. Examples of immunoassays are 

35 radioimmunoassays (RIA), enzyme immunoassays (e.g. ELISA), immunofluorescence, immunoprecipitation, 
latex agglutination, hemagglutination, histochemical tests, and sandwich (immunometric) assays. These terms are 
well understood by those skilled in the art. A person skilled in the art will know, or can readily discern, other 
immunoassay formats without undue experimentation. 

According to an embodiment of the invention, an immunoassay for detecting one or more endometrial 
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markers in a biological sample comprises contacting binding agents that specifically bind to endometrial markers 
in the sample under conditions that allow the formation of first complexes comprising a binding agent and 
endometrial markers and determining the presence or amount of the complexes as a measure of the amount of 
endometiial markers contained in the sample. In a particular embodiment, the binding agents are labeled 
5 differently or are capable of binding to different labels. 
' Antibodies may be used to detect and quantify one or more endometrial markers in a sample in order to 

diagnose and treat pathological states. In particular, the antibodies may be used in immunohistochemical 
analyses, for example, at the cellular and sub-subcellular level, to detect one or more endometrial markers, to 
localize them to particular endometrial cells and tissues (e.g. tumor cells and tissues), and to specific subcellular 

10 locations, and to quantitate the level of expression. 

Immunohistochemical methods for the detection of antigens in tissue samples are well known in the art. 
For example, immunohistochemical methods are described in Taylor, Arch, Pathol. Lab. Med. 102: 112 (1978). 
Briefly, in the context of the present invention, a tissue sample obtained from a subject suspected of having an 
endometrial-related problem is contacted with antibodies, preferably monoclonal antibodies recognizing one or 

15 more endometrial markers. The site at which the antibodies are bound is determined by selective staining of the 
sample by standard immunohistochemical procedures. The same procedure may be repeated on the same sample 
using other antibodies that recognize one or more endometrial markers. Alternatively, a sample may be contacted 
with antibodies against one or more endometrial markers simultaneously, provided that the antibodies are labeled 
differently or are able to bind to a different label. The tissue sample may be normal endometrial tissue, a cancer 

20 tissue or a benign tissue. 

An antibody microarray in which binding sites comprise immobilized, preferably monoclonal, 
antibodies specific to a substantial fraction of marker- derived endometrial markers of interest can be utilized in 
the present invention. Antibody arrays can be prepared using methods known in the art [(see for example, Zhu et 
al., Science 293:2101 (2001) and reference 20]. 

25 Antibodies specific for one or more endometrial marker may be labelled with a detectable substance and 

localised in biological samples based upon the presence of the detectable substance. Examples of detectable 
substances include, but are not limited to, the following: radioisotopes (e.g., ^H, ^"^C, ^^S, ^^^I, ^^^I), fluorescent 
labels (e.g., FITC, rhodamine, lanthanide phosphors), luminescent labels such as luminol; enzymatic labels (e.g., 
horseradish peroxidase, beta-galactosidase, luciferase, alkaline phosphatase, acetylcholinesterase), biotinyl 

30 groups (which can be detected by marked avidin e.g., streptavidin containing a fluorescent marker or enzymatic 
activity that can be detected by optical or colorimetric methods), predetermined polypeptide epitopes recognized 
by a secondary reporter (e.g., leucine zipper pair sequences, binding sites for secondary antibodies, metal binding 
domains, epitope tags). In some embodiments, labels are attached via spacer arms of various lengths to reduce 
potential steric hindrance. Antibodies may also be coupled to electron dense substances, such as ferritin or 

35 colloidal gold, which are readily visualised by electron microscopy. 

One of the ways an antibody can be detectably labeled is to link it directly to an enzyme. The enzyme 
when later exposed to its substrate will produce a product that can be detected. Examples of detectable 
substances that are enzymes are horseradish peroxidase, beta-galactosidase, luciferase, alkaline phosphatase, 
acetylcholinesterase, malate dehydrogenase, ribonuclease, urease, catalase, glucose-6-phosphate, staphylococcal 
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nuclease, delta-5-steriod isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate, triose phosphate 
isomerase, asparaginase, glucose oxidase, and acetylcholine esterase. 

For increased sensitivity in an immunoassay system a fluorescence-emitting metal atom such as Eu 
(europium) and other lanthanides can be used. These can be attached to the desired molecule by means of metal- 

5 chelating groups such as DTPA or EDTA. 

A bioluminescent compound may also be used as a detectable substance, Bioluminescence is a type of 
chemiluminescence found in biological systems where a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent molecule is determined by detecting the presence 
of luminescence. Examples of bioluminescent detectable substances are luciferin, luciferase and aequorin. 

1 0 Indirect methods may also be employed in which the primary antigen-antibody reaction is amplified by 

the introduction of a second antibody, having specificity for the antibody reactive against one or more 
endometrial markers. By way of example, if the antibody having specificity against one or more endometrial 
markers is a rabbit IgG antibody, the second antibody may be goat anti-rabbit gamma-globulin labelled with a 
detectable substance as described herein. 

15 Methods for conjugating or labelling the antibodies discussed above may be readily accomplished by 

one of ordinary skill in the art. (See for example Inman, Methods In Enzymology, Vol. 34, Affinity Techniques, 
Enzyme Purification: Part B, Jakoby and Wichek (eds.), Academic Press, New York, p. 30, 1974; and Wilchek 
and Bayer, "The Avidin-Biotin Complex in Bioanalytical AppUcations,"Anal. Biochem. 171:1-32, 1988 re 
methods for conjugating or labelling the antibodies with enzyme or ligand binding partner). 

20 Cytochemical techniques known in the art for localizing antigens using light and electron microscopy 

may be used to detect one or more endometrial markers. Generally, antibodies may be labeled with detectable 
substances and one or more endometrial markers may be localised in tissues and cells based upon the presence of 
the detectable substances. 

hi the context of the methods of the invention, the sample, binding agents (e.g. antibodies specific for 

25 one or more endometrial markers)^ or one or more endometrial markers may be immobilized on a carrier or 
support. Examples of suitable carriers or supports are agarose, cellulose, nitrocellulose, dextran, Sephadex, 
Sepharose, liposomes, carboxjonethyl cellulose, polyacrylamides, polystyrene, gabbros, filter paper, magnetite, 
ion-exchange resin, plastic film, plastic tube, glass, polyamine-methyl vinyl-ether-maleic acid copolymer, amino 
acid copolymer, ethylene-maleic acid copolymer, nylon, silk, etc. The support material may have any possible 

30 configuration including spherical (e.g. bead), cylindrical (e.g. inside surface of a test tube or well, or the external 
surface of a rod), or flat (e.g. sheet, test strip). Thus, the carrier may be in the shape of, for example, a tube, test 
plate, well, beads, disc, sphere, etc. The immobilized antibody may be prepared by reacting the material with a 
suitable insoluble carrier using known chemical or physical methods, for example, cyanogen bromide coupling. 
An antibody may be indirectly immobiUzed using a second antibody specific for the antibody. For example, 

35 mouse antibody specific for an endometrial marker may be immobilized using sheep anti-mouse IgG Fc fragment 
specific antibody coated on the carrier or support. 

Where a radioactive label is used as a detectable substance, one or more endometrial marker may be 
localized by radioautography. The results of radioautography may be quantitated by determining the density of 
particles in the radioautographs by various optical methods, or by counting the grains. 
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Time-resolved fluorometry may be used to detect a signal. For example, the method described in 
Christopoulos TK and Diamandis BP Anal Chem 1992:64:342-346 may be used with a conventional time- 
resolved fluorometer. 

In accordance with an embodiment of the invention, a method is provided wherein one or more 
5 endometrial marker antibodies are directly or indirectly labelled with enzymes, substrates for the enzymes are 
added wherein the substrates are selected so that the substrates, or a reaction product of an enzyme and substrate, 
form fluorescent complexes with a lanthanide metal (e.g. europium, terbium, samarium, and dysprosium, 
preferably europium and terbium). A lanthanide metal is added and dne or more endometrial cancer markers are 
quantitated in the sample by measuring fluorescence of the fluorescent complexes. Enzymes are selected based 

10 on the ability of a substrate of the enzyme, or a reaction product of the enzyme and substrate, to complex with 
lanthanide metals such as europium and terbium. Sxiitable enzymes and substrates that provide fluorescent 
complexes are described in U.S. Patent No. 5,3112,922 to Diamandis. Examples of suitable enzymes include 
alkaline phosphatase and P-galactosidase. Preferably, the enzyme is alkaline phosphatase. 

Examples of enzymes and substrates for enzymes that provide such fluorescent complexes are described 

15 in U.S. Patent No. 5,312,922 to Diamandis. By way of example, when the antibody is directly or indirectly 
labelled with alkaline phosphatase the substrate employed in the method may be 4-methylumbelliferyl 
phosphate, 5-fluorosalicyl phosphate, or diflunisal phosphate. The fluorescence intensity of the complexes is 
typically measured using a time-resolved fluorometer e.g. a CyberFluor 615 Imunoanalyzer (Nordion 
International, ICanata, Ontario). 

20 One or more endometrial marker antibodies may also be indirectly labelled with an enzyme. For 

example, the antibodies may be conjugated to one partner of a ligand binding pair, and the enzyme may be 
coupled to the other partner of the ligand binding pair. Representative examples include avidin-biotin, and 
riboflavin-riboflavin binding protein. In an embodiment, the antibodies are biotinylated; and the enzyme is 
coupled to streptavidin. In another embodiment, an antibody specific for endometrial marker antibody is labeled 

25 with an enzyme. 

In accordance with an embodiment, the present invention provides means for determining one or more 
endometrial markers in a sample by measuring one or more endometrial markers by immunoassay. It will be 
evident to a skilled artisan that a variety of immunoassay methods can be used to measure one or more 
endometrial markers. In general, an immunoassay method may be competitive or noncompetitive . Competitive 

30 methods typically employ an immobilized or immobilizable antibody to one or more endometrial marker and a 
labeled form of one or more endometrial marker. Sample endometrial markers and labeled endometrial markers 
compete for binding to antibodies to endometrial markers. After separation of the resulting labeled endometrial 
markers that have become bound to antibodies (bound fraction) from that which has remained unbound (unbound 
fraction), the amount of the label in either bound or unbound fraction is measured and maybe correlated witli tlie 

35 amount of endometrial markers in the test sample in any conventional manner, e.g. , by corriparison to a standard 
curve. 

In an aspect, a non-competitive method is used for the determination of one or more endometrial 
markers, with the most common method being the "sandwich" method. In this assay, two antibodies to 
endometrial markers are employed. One of the antibodies to endometrial markers is directly or indirectly labeled 
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(sometimes referred to as the "detection antibody") and the other is immobilized or immobilizable (sometimes 
referred to as the "capture antibody"). The capture and detection antibodies can be contacted simultaneously or 
sequentially with the test sample. Sequential methods can be accomplished by incubating the capture antibody 
with the sample, and adding the detection antibody at a predetermined time thereafter (sometimes referred to as 
5 the "forward" method); or the detection antibody can be incubated with the sample first and then the capture 
antibody added (sometimes referred to as the "reverse" method). After the necessary incubation(s) have occurred, 
to complete the assay, the capture antibody is separated from the liquid test mixture, and the label is measured in 
at least a portion of the separated capture antibody phase or the remainder of the liquid test mixture. Generally it 
is measured in the capture antibody phase since it comprises endometrial cancer markers bound by 

10 ("sandwiched" between) the capture and detection antibodies. In an embodiment, the label may be measured 
without separating the capture antibodies and liquid test mixture. 

In a typical two-site immunometric assay for endometrial markers, one or botli of tlie capture and 
detection antibodies are polyclonal antibodies or one or both of the capture and detection antibodies are 
monoclonal antibodies (i.e. polyclonal/polyclonal, monoclonal/monoclonal, or monoclonal/polyclonal). The label 

15 used in the detection antibody can be selected from any of those known conventionally in the art. The label may 
be an enzyme or a chemiluminescent moiety, but it can also be a radioactive isotope, a fluorophor, a detectable 
ligand (e.g., detectable by a secondary binding by a labeled binding partner for tlie ligand), and the like: In a 
particular aspect, the antibody is labelled with an enzyme which is detected by adding a substrate that is selected 
so that a reaction product of the enzyme and substrate forms fluorescent complexes. The capture antibody may be 

20 selected so that it provides a means for being separated from the remainder of the test mixture. Accordingly, the 
capture antibody can be introduced to the assay in an already immobilized or insoluble form, or can be in an 
immobilizable form, that is, a form which enables immobilization to be accomplished subsequent to introduction 
of the capture antibody to the assay. An immobilized capture antibody may comprise an antibody covalently or 
noncovalently attached to a solid phase such as a magnetic particle, a latex particle, a microtiter plate well, a 

25 bead, a cuvette, or other reaction vessel.- An example of an immobilizable capture antibody is antibody which has 
been chemically modified with a ligand moiety, e. g., a hapten, biotin, or the like, and which can be subsequently 
immobilized by contact with an immobilized form of a binding partner for the ligand, e.g., an antibody, avidin, or 
the like. In an embodiment, the capture antibody may be immobilized using a species specific antibody for the 
capture antibody that is bound to the solid phase. 

30 The above-described immunoassay methods and formats are intended to be exemplary and are not 

limiting. 

Computer Systems 

Analytic methods contemplated herein can be implemented by use of computer systems and methods 
described below and known in the art. Thus, the invention provides computer readable media comprising one or 
35 more endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and optionally 
other markers (e.g. markers of endometrial cancer). "Computer readable media" refers to any medium that can be 
read and accessed directly by a computer, including but not Ihnited to magnetic storage media, such as floppy 
discs, hard disc storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage 
media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. Thus, the 
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invention contemplates computer readable medium having recorded thereon markers identified for patients and 
controls. 

"Recorded" refers to a process for storing information on computer readable medium. The skilled 

r 

artisan can readily adopt any of the presently known methods for recording information on computer readable 
5 medium to generate manufactures comprising information on one or more endometrial markers, and optionally 
other markers. 

A variety of data processor programs and formats can be used to store information on one or more 
endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and other markers on 
computer readable medium. For example, the information can be represented in a word processing text file, 
10 formatted in commercially-available software such as WordPerfect and MicroSoft Word, or represented in the 
form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like. Any number of 
dataprocessor structuring formats (e.g., text file or database) may be adapted in order to obtain computer readable 
medium having recorded thereon the marker information. 

By providing the marker information in computer readable form, one can routinely access the 
15 information for a variety of purposes. For example, one sldlled in the art can use the information in computer 
readable form to compare marker information obtained during or following therapy wilh the information stored 
within the data storage means. 

The invention provides a medium for holding instructions for performing a method for determining 
uterine endometrial receptivity of a patient, or whether a patient has an endometrial disease (e.g. endometrial 
20 cancer) or a pre-disposition to an endometrial disease (e.g. cancer), comprising determining the presence or 
absence of one or more endometrial markers, and/or polynucleotides encoding one or more endometrial markers, 
and optionally other markers, and based on the presence or absence of the one or more endometrial markers, 
and/or polynucleotides encoding one or more endometrial markers, and optionally other markers, determining 
uterine endometrial receptivity, endometrial disease ( e.g. cancer) or a pre-disposition to an endometrial disease 
25 (e.g. cancer), and optionally recommending a procedure or treatment. 

The invention also provides in an electronic system and/or in a network, a method for determining 
uterine endometrial receptivity of a patient, whether a subject has an endometrial disease (e.g. cancer) or a pre- 
disposition to an endometrial disease (e.g. cancer), comprising determining the presence or absence of one or 
more endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and optionally 
30 otlier markers (e.g. cancer markers), and based on the presence or absence of the one or more endometrial 
markers, and/or polynucleotides encoding one or more endometrial markers, and optionally other markers, 
determining the uterine endometrial receptivity of the patient, whether the subject has an endometrial disease 
(e.g. cancer) or a pre-disposition to an endometrial disease (e.g. cancer), and optionally recommending a 
procedure or treatment. 

35 The invention further provides in a network, a method for determining whether a subj ect is receptive to 

in vitro fertilization, has an endometrial disease (e.g. cancer) or a pre-disposition to an endometrial disease (e.g. 
cancer) comprising: (a) receiving phenotypic information on the subject and information on one or more 
endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and optionally other 
markers associated with samples from the subject; (b) acquiring information from the network corresponding to 
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the one or more endornetrial markers, and/or polynucleotides encoding one or more endometrial markers, and 
optionally other markers; and (c) based on tlie phenotypic information and information on the one or more 
endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and optionally other 
markers, determining whether the subject is receptive to in vitro fertilization, has an endometrial disease (e.g. 
5 cancer) or a pre-disposition to an endometrial disease (e.g. cancer); and (d) optionally recommending a procedure 
or treatment 

The invention still further provides a system for identifying selected records that identify a diseased 
endometrial cell or tissue (e.g. cancer cell or tissue) or an endometrium phase. A system of the invention 
generally comprises a digital computer; a database server coupled to the computer; a database coupled to the 
10 database server having data stored therein, the data comprising records of data comprising one or more 
endometrial markers, and/or polynucleotides encoding one or more endometrial markers, and optionally other 
endometrial markers, and a code mechanism for applying queries based upon a desired selection criteria to the 
data file in the database to produce reports of records which match the desired selection criteria. 

In an aspect of the invention a method is provided for detecting endometrial cancer tissue or cells using 
15 a computer having a processor, memory, display, and input/output devices, the method comprising the steps of: 

(a) creating records of one or more endometrial cancer markers, and/or polynucleotides encoding 
one or more endometrial cancer markers, and optionally other markers of cancer identified in a 
sample suspected of containing endometrial cancer cells or tissue; 

(b) providing a database comprising records of data comprising one or more endometrial cancer 
20 markers, and/or polynucleotides encoding one or more endometrial cancer markers, and 

optionally other markers of cancer; and 

(c) using a code mechanism for applying queries based upon a desired selection criteria to the 
data file in the database to produce reports of records of step (a) which provide a match of the 
desired selection criteria of the database of step (b) the presence of a match being a positive 

25 indication that the markers of step (a) have been isolated from cells or tissue that are 

endometrial cancer cells or tissue. 
Hie invention contemplates a business method for determining whether a subject is receptive to in vitro 
fertilization, has an endometrial disease (e.g. cancer) or a pre-disposition to endometrial cancer comprising: (a) 
receiving phenotypic information on the subject and information on one or more endometrial markers, and/or 
30 polynucleotides encoding the markers, and optionally other markers, associated with samples from the subject; 
(b) acquiring information from a network corresponding to one or more endometrial markers, and/or 
polynucleotides encoding the markers, and optionally other markers; and (c) based on the phenotypic 
information, information on one or more endometrial markers, and/or polynucleotides encoding the markers, and 
optionally other markers, and acquired information, determining whether the subject is receptive to in vitro 
35 fertilization, has an endometrial disease (e.g. cancer) or a pre-disposition to an endometrial disease (e.g. cancer); 
and (d) optionally recommending a procedure or treatment. 

In an aspect of the invention, the computer systems, components, and methods described herein are used 
to monitor disease or determine the stage of disease, or determine uterine endometrial receptivity. 
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■ 

Imaging Methods 

Binding agents, in particular antibodies, specific for one or more endometrial markers may also be used 
in imaging methodologies in the management of an endometrial disease or determining uterine endometrial 
receptivity. 

5 In an aspect, the invention provides a method for imaging tumors associated with one or more 

endometrial cancer markers. 

The invention also contemplates imaging methods described herein using multiple markers for an 
endometrial disease or endometrium phase. Preferably each agent is labeled so that it can be distinguished during 
the imaging. 

10 In an embodiment the method is an in vivo method and a subject or patient is administered one or more 

agents that carry an imaging label and that are capable of targeting or binding to one or more endometrial 
markers. The agent is allowed to incubate in vivo and bind to the endometrial markers associated with 
endometrial cells or tissues of a particular phase or associated with diseased cells or tissues, (e. g. an endometrial 
tumor). The presence of the label is localized to the endometrial cells or tissues, and the localized label is 

15 detected using imaging devices known to those skilled in the art. 

The agent may be an antibody or chemical entity that recognizes the endometrial markers. In an aspect 
of the invention the agent is a polyclonal antibody or monoclonal antibody, or fragments thereof, or constructs 
thereof including but not limited to, single chain antibodies, bifunctional antibodies, molecular recognition units, 
and peptides or entities that mimic peptides. The antibodies specific for the endometrial markers used in the 

20 methods of the invention may be obtained from scientific or commercial sources, or isolated native endometrial 
markers or recombinant endometrial markers may be utilized to prepare antibodies etc. as described herein. 

An agent may be a peptide that mimics the epitope for an antibody specific for an endometrial marker 
and binds to the marker. Tlie peptide may be produced on a commercial synthesizer using conventional solid 
phase chemistry. By way of example, a peptide may be prepared that includes either tyrosine, lysine, or 

25 phenylalanine to which N2S2 chelate is complexed (See U.S. Patent No. 4,897,255). An anti-endocrine marker 
peptide conjugate is then combined with a radiolabel (e.g. sodium ^^"Tc pertechnetate or sodium ^^^Re 
perrhenate) and it may be used to locate an endometrial marker producing cell or tissue (e.g. tumor). 

The agent carries a label to image the endometrial markers. The agent may be labelled for use in 
radionuclide imaging. In particular, the agent may be directly or indirectly labelled with a radioisotope. Examples 

30 of radioisotopes that maybe used in the present invention are the following: ^"Ac, ^At, ^^^Ba, ^^*Ba, ''Be, ^"'^Bi, 

2053.^ 2063.^ 763^^ 773^^ 823^^ 109^^^ 47^^^ 11^^ 14^^ 36^.1^ 48^^^ 51^^^ 62^.^^ 64^^^ 67^^ 165^^^ ISSg^^ ISj,^ 153q^^ 66q^^ 

^'Ga, ^«Ga, ^Ga, ^^«Au, ^H, ^*^^Ho, "'in, "'">In, "^™In, '«^Ir, ^^'"^Ir, ^^^Ir, ^^^Ir, ^^Fe, ^^Fe, ^^Fe, ^^^Lu, 

191m.l91og^ 109p^^ 32p^ 33p^ 42j.^ 226^^ 186^^^ 188^^^ 82mj^b^ 153g^^ 46g^^ 41^^^ 12^^^ 75^^^ 105^^^ 22^^ 2^^^ 89^^^ 

''S, '^'Ta, ^^Tc, ^^"Tc, ^^^Tl, ^°^T1, '^^Sn, '»'"Sn, '^^Sn, ^^^Yb, '''Vh, '''Yh, «^Y, ^°Y, '^Zn and "Zn. 
35 Preferably the radioisotope is "'l, '^I, '""X ' ' 'l, ^^"Tc, ^°Y, "^Re, ^««Re, ^^P, '^^Sm, ^^Ga, ^^^Tl ^ ^ or and is 
imaged with a photoscanning device. 

Procedures for labeling biological agents with the radioactive isotopes are generally known in the art. 
U.S. Pat. No. 4,302,438 describes tritium labeling procedures. Procedures for iodinating, tritium labeling, and^^ 
S labeling especially adapted for murine monoclonal antibodies are described by Coding, J. W. (supra, pp 124- 
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126) and the references cited therein. Other procedures for iodinating biological agents, such as antibodies, 
binding portions thereof, probes, or ligands, are described in the scientific literature (see Hunter and Greenwood, 
Nature 144:945 (1962), David et al.. Biochemistry 13:1014-1021 (1974), and U.S. Pat. Nos. 3,867,517 and 
4,376,110). Iodinating procedures for agents are described by Greenwood, F. et al., Biochem. J. 89:114-123 

5 (1963); Marchalonis, J., Biochem. J. 1 13:299-305 (1969); and Morrison, M. et al., Immunochemistry, 289-297 
(1971). ^^"^ Tc-labeling procedures are described by Rhodes, B. et al. in Burchiel, S. et al. (eds.). Tumor Imaging: 
The Radioimmunochemical Detection of Cancer, New York: Masson 111- 123 (1982) and the references cited 
therein. Labelling of antibodies or fragments with technetium-99m are also described for example in U.S. Pat. 
No. 5,317,091. U.S. Pat. No. 4,478,815, U.S. Pat. No. 4,478,818, U.S. Pat No. 4,472,371, U.S. Pat. No. Re 

10 32,417, and U.S. Pat No. 4,311,688. Procedures suitable for In-labeling biological agents are described by 
Hnatowich, D, J. et al., J, Immul. Methods, 65:147-157 (1983), Hnatowich, D. et al., J. Applied Radiation, 
35:554-557 (1984), and Buckley, R. G. et al., F.E.B.S. 166:202-204 (1984). 

An agent may also be labeled with a paramagnetic isotope for purposes of an in vivo method of the 
invention. Examples of elements that are useful in magnetic resonance imaging include gadolinium, terbium, tin, 

15 iron, or isotopes thereof. (See, for example, Schaefer et al., (1989) JACC 14, 472-480; Shreve et al., (1986) 
Magn. Reson. Med. 3, 336-340; Wolf, G L., (1984) Physiol. Chem. Phys. Med. NMR 16, 93-95; Wesbey et al., 
(1984) Physiol. Chem. Phys. Med. NMR 16, 145-155; Runge et al., (1984) Invest. Radiol. 19, 408-415 for 
discussions on in vivo nuclear magnetic resonance imaging.) 

In the case of a radiolabeled agent, the agent may be administered to the patient, it is localized to the cell 

20 or tissue (e.g. tumor) having an endometrial marker with which the agent binds, and is detected or "imaged" in 
vivo using known techniques such as radionuclear scanning using e.g., a gamma camera or emission tomography. 
[See for example, A. R. Bradwell et al., "Developments in Antibody Imaging", Monoclonal Antibodies for 
Cancer Detection and Therapy, R. W. Baldwin et al., (eds.), pp. 65-85 (Academic Press 1985)]. A positron 
emission transaxial tomography scanner, such as designated Pet VI located at BrooMiaven National Laboratory, 

25 can also be used where the radiolabel emits positrons (e.g., C, F, O, and N). 

Whole body imaging techniques using radioisotope labeled agents can be used for locating diseased 
cells and tissues ( e.g. primary tumors and tumors which have metastasized). Antibodies specific for endometrial 
markers, or fragments thereof having the same epitope specificity, are bound to a suitable radioisotope, or a 
combination thereof, and administered parenterally. For endometrial cancer, administration prefeiably is 

30 intravenous. The bio- distribution of the label can be monitored by scintigraphy, and accumulations of the label 
are related to the presence of endometrial cancer cells. Whole body imaging techniques are described in U.S. Pat. 
Nos. 4,036,945 and 4,311,688. Other examples of agents useful for diagnosis and therapeutic use that can be 
coupled to antibodies and antibody fragments include metallothionein and fragments (see, U.S. Pat. No. 
4,732,864). These agents are useful in diagnosis staging and visualization of cancer, in particular endometrial 

35 cancer, so that surgical and/or radiation treatment protocols can be used more efficiently. 

An imaging agent may carry a bioluminescent or chemiluminescent label. Such labels include 
polypeptides known to be fluorescent, bioluminescent or chemiluminescent, or, that act as enzymes on a specific 
substrate (reagent), or can generate a fluorescent, bioluminescent or chemiluminescent molecule. Examples of 
bioluminescent or chemiluminescent labels include luciferases, aequorin, obelin, mnemiopsin, berovin, a 
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phenanthridinium ester, and variations thereof and combinations thereof. A substrate for the bioluminescent or 
chemiluminescent polypeptide may also be utilized in a method of the invention. For example, the 
chemiluminescent polypeptide can be luciferase and the reagent luciferin. A substrate for a bioluminescent or 
chemiluminescent label can be administered before, at the same time (e.g., in the same formulation), or after 
5 administration of the agent. 

An imaging agent may comprise a paramagnetic compound, such as a polypeptide chelated to a metal, 
e.g., a metalloporphyrin. The paramagnetic compound may also comprise a monocrystalline nanoparticle, e.g., a 
nanoparticle comprising a lanthanide (e.g., Gd) or iron oxide; or, a metal ion comprising a lanthanide. 
"Lanthanides" refers to elements of atomic numbers 58 to 70, a transition metal of atomic numbers 21 to 29, 42 

10 or 44, a Gd(III), a Mn(II), or an element comprising a Fe element. Paramagnetic compounds can also comprise a 
neodymium iron oxide (NdFeOs) or a dysprosium iron oxide (DyFeOa). Examples of elements that are useful in 
magnetic resonance imaging include gadolinium, terbium, tin, iron, or isotopes thereof. (See, for example, 
Schaefer et al., (1989) JACC 14, 472-480; Shreve et al., (1986) Magn. Reson. Med. 3, 336-340; Wolf, G L., 
(1984) Physiol, Chem. Phys. Med. NMR 16, 93-95; Wesbey et al., (1984) Physiol. Chem. Phys. Med. NMR 16, 

15 145- 1 55; Runge et al., (1984) Invest. Radiol. 19, 408-415 for discussions on in vivo nuclear magnetic resonance 
imaging.) 

An image can be generated in a method of the invention by computer assisted tomography (CAT), 
magnetic resonance spectroscopy (MRS) image, magnetic resonance imaging (MRI), positron emission 
tomography (PET), single-photon emission computed tomography (SPECT), or bioluminescence imaging (BLI) 
20 or equivalent. 

Computer assisted tomography (CAT) and computerized axial tomography (CAT) systems and devices 
well known in the art can be utilized in the practice of the present invention. ( See, for example, U.S. Patent Nos. 
6,151,377; 5,946,371; 5,446,799; 5,406,479; 5,208,581; 5,109,397). The invention may also utilize animal 
imaging modalities, such as MicroCAT.TM. (ImTek, Inc.). 

25 Magnetic resonance imaging (MRI) systems and devices well known in the art can be utilized in the 

practice of the present invention. In magnetic resonance methods and devices, a static magnetic field is applied to 
a tissue or a body in order to define an equilibrium axis of magnetic alignment in a region of interest. A radio 
frequency field is then applied to the region in a direction orthogonal to the static magnetic field direction to 
excite magnetic resonance in the region. The resulting radio fi*equency signals are then detected and processed, 

30 and the exciting radio fi*equency field is applied. The resulting signals are detected by radio-fi*equency coils that 
are placed adjacent to the tissue or area of the body of interest. (For a description of MRI methods and devices 
see, for example, U.S. PatentNos. 6,151,377; 6,144,202; 6,128,522; 6,127,825; 6,121,775; 6,119,032; 6,115,446; 
6,11 1,410; 602,891; 5,555,251; 5,455,512; 5,450,010; 5,378,987; 5,214,382; 5,031,624; 5,207,222; 4,985,678; 
4,906,931; 4,558,279). MRI and supporting devices are commercially available for example, firom Bruker 

35 Medical GMBH; Caprius; Esaote Biomedica; Fonar; GE Medical Systems (GEMS); Hitachi Medical Systems 
America; Intermagnetics General Corporation; Lunar Corp.; MagneVu; Marconi Medicals; Philips Medical 
Systems; Shimadzu; Siemens; Toshiba America Medical Systems; including imaging systems, by, e.g., Silicon 
Graphics. The invention may also utilize animal imaging modalities such as micro-MRIs. 
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Positron emission tomography imaging (PET) systems and devices well known in the art can be utilized 
in the practice of the present invention. For example, a method of the invention may use the system designated 
Pet VI located at Brookhaven National Laboratory. For descriptions of PET systems and devices see, for 
example, U.S. Pat. Nos. 6,151,377; 6,072,177; 5,900,636; 5,608,221; 5,532,489; 5,272,343; 5,103,098. Animal 
5 imaging modalities such as micro-PETs (Corcorde Microsystems, Inc.) can also be used in the invention. 

Single-photon emission computed tomography (SPECT) systems and devices well known in the art can 
be utilized in the practice of the present invention. (See, for example, U.S. Patents. Nos. 6,1 15,446; 6,072, 177; 
5,608,221; 5,600,145; 5,210,421; 5,103,098. ) The methods of the invention may also utilize animal imaging 
modalities, such as micro-SPECTs. 

10 Bioluminescence imaging includes bioluminescence, fluorescence or chemiluminescence or other 

photon detection systems and devices that are capable of detecting bioluminescence, fluorescence or 
chemiluminescence. Sensitive photon detection systems can be used to detect bioluminescent and fluorescent 
proteins externally; see, for example, Contag (2000) Neoplasia 2:41-52; Zhang (1994) Clin. Exp. Metastasis 
12: 87-92. The methods of the invention can be practiced using any such photon detection device, or variation or 

15 equivalent thereof, or in conjunction with any known photon detection methodology, including visual imaging. 
By way of example, an intensified charge-coupled device (ICCD) camera coupled to an image processor may be 
used in the present invention. (See, e.g., U.S. Pat. No. 5,650,135). Photon detection devices are also 
commercially available from Xenogen, Hamamatsue. 
Screening Methods 

20 The invention also contemplates methods for evaluating test agents or compounds for their ability to 

inhibit an endometrial disease (e.g. cancer), potentially contribute to an endometrial disease (e.g. cancer), or 
inhibit or enhance an endometrium phase. Test agents and compounds include but are not limited to peptides 
such as soluble peptides including Ig-tailed fusion peptides, members of random peptide libraries and 
combinatorial chemistry-derived molecular libraries made of D- and/or L-configuration amino acids, 

25 phosphopeptides (including members of random or partially degenerate, directed phosphopeptide libraries), 
antibodies [e.g. polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, single chain antibodies, fragments, 
(e.g. Fab, F(ab)2, and Fab expression library fragments, and epitope-binding fragments thereof)], and small 
organic or inorganic molecules. The agents or compounds may be endogenous physiological compounds or 
natural or synthetic compounds. 

30 The invention provides a method for assessing the potential efficacy of a test agent for inhibiting an 

endometrial disease (e.g. cancer) in a patient, the method comprising comparing: 

(a) levels of one or more endometrial markers, and/or polynucleotides encoding endometrial 
markers, and optionally other markers in a first sample obtained from a patient and exposed to 
the test agent; and 

35 (b) levels of one or more endometrial markers and/or polynucleotides encoding endometrial 

markers, and optionally other markers, in a second sample obtained from the patient, wherein 
the sample is not exposed to the test agent, wherein a significant difference in the levels of 
expression of one or more endometrial markers, and/or polynucleotides encoding one or more 
endometrial markers, and optionally the other markers, in the first sample, relative to the 
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second sample, is an indication that the test agent is potentially efficacious for inhibiting an 
endometrial disease (e.g. cancer) in the patient. 
The first and second samples may be portions of a single sample obtained from a patient or portions of 
pooled samples obtained from a patient. 
5 In an aspect, the invention provides a method of selecting an agent for inhibiting ah endometrial disease 

(e.g. cancer) in a patient comprising: 

(a) obtaining a sample from the patient; 

(b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing one or more endometrial markers, and/or polynucleotides encoding endometrial 
10 markers, and optionally other markers, in each of the aliquots; and 

(d) selecting one of the test agents which alters the levels of one or more endometrial markers, 
and/or polynucleotides encoding endometrial markers, and optionally other markers in the 
aliquot containing that test agent, relative to other test agents. 

In a further aspect, the invention provides a method of selecting an agent for inhibitmg or enhancing an 
15 endometrium phase in a patient comprising: 

(a) obtaining a sample of endometrium in a selected phase (e.g. secretory or proliferative phase); 

(b) separately maintaining aUquots of the sample in the presence of a plurality of test agents; 

(c) comparing one or more endometrial markers, and/or polynucleotides encoding endometrial 
markers, and optionally other markers, in each of the aliquots; and 

20 (d) selecting one of the test agents which alters the levels of one or more endometrial markers, 

and/or polynucleotides encoding endometrial markers, and optionally other markers in the 
aliquot containing that test agent, relative to other test agents. 
Still another aspect of the present invention provides a method of conducting a drug discovery business 
comprising: 

25 (a) providing one or more methods or assay systems for identifying agents that inhibit an 

endometrial disease (e.g. endometrial cancer) or affect an endometrium phase in a patient; 

(b) conducting therapeutic profiling of agents identified in step (a), or further analogs thereof, for 
efficacy and toxicity in animals; and 

(c) formulating a pharmaceutical preparation including one or more agents identified in step (b) as 
30 having an acceptable therapeutic profile. 

In certain embodiments, the subject method can also include a step of establishing a distribution system 
for distributing the pharmaceutical preparation for sale, and may optionally include establishing a sales group for 
marketing the pharmaceutical preparation. 

The invention also contemplates a method of assessing the potential of a test compound to contribute to 
35 an endometrial disease (e.g. endometrial cancer) comprising: 

(a) maintaining separate aliquots of cells or tissues from a patient with an endometrial disease 
(e.g. cancer) in the presence and absence of the test compound; and 

(b) comparing one or more endometrial markers, and/or polynucleotides encoding endometrial 
markers, and optionally other markers in each of the aliquots. 
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A sigaificant difference between the levels of the markers in the aliquot maintained in the presence of 
(or exposed to) the test compound relative to the aliquot maintained in the absence of the test compound, 
indicates that the test compound possesses the potential to contribute to an endometrial disease (e.g. endometrial 
cancer). 
5 Kits 

The invention also contemplates kits for carrying out the methods of the invention. Kits may typically 
comprise two or more components required for performing a diagnostic assay. Components include but are not 
limited to compounds, reagents, containers, and/or equipment. 

The methods described herein may be performed by utilizing pre-packaged diagnostic kits comprising 

10 one or more specific endometrial marker polynucleotide or antibody described herein, which may be 
conveniently used, e.g., in clinical settings to screen and diagnose patients and to screen and identify those 
individuals exhibiting a predisposition to developing an endometrial disease. 

In an embodiment, a container with a kit comprises a binding agent as described herein. By way of 
example, the kit may contain antibodies or antibody fragments which bind specifically to epitopes of one or more 

15 endometrial markers and optionally other markers, antibodies against the antibodies labelled with an enzyme; 
and a substrate for the enzyme. The kit may also contain microtiter plate wells, standards, assay diluent, wash 
buffer, adhesive plate covers, and/or instructions for carrying out a method of the invention using the Icit. 

In an aspect of tlie invention, the kit includes antibodies or fragments of antibodies which bind 
specifically to an epitope of one or more protein listed in Table 1, 4, 5, or 6 and means for detecting binding of 

20 the antibodies to their epitope associated with tumor cells, either as concentrates (including lyophilized 
compositions), which may be further diluted prior to use or at the concentration of use, where the vials may 
include one or more dosages. Where the kits are intended for in vivo use, single dosages may be provided in 
sterilized containers, having the desired amount and concentration of agents. Containers that provide a 
formulation for direct use, usually do not require other reagents, as for example, where the kit contains a 

25 radiolabelled antibody preparation for iji vivo imaging. 

A kit may be designed to detect the level of polynucleotides encoding one or more endometrial 
polynucleotide markers in a sample. In an embodiment, the polynucleotides encode one or more polynucleotides 
encoding a polypeptide listed in Table 1, 4, 5 or 6. Such kits generally comprise at least one oligonucleotide 
probe or primer, as described herein, that hybridizes to a polynucleotide encoding one or more endometrial 

30 cancer markers. Such an oligonucleotide may be used, for example, within a PCR or hybridization procedure. 
Additional components that may be present within the kits include a second oligonucleotide and/or a diagnostic 
reagent or containerto facilitate detection of a polynucleotide encoding one or more endometrial cancer markers. 

The invention provides a kit containing a micoarray described herein ready for hybridization to target 
endometrial polyniicleotide markers, plus software for the data analysis of the results. The software to be 

35 included with the kit comprises data analysis methods, in particular mathematical routines for marker discovery, 
including the calculation of correlation coefficients between clinical categories and marker expression. The 
software may also include mathematical routines for calculating the correlation between sample marker 
expression and control marker expression, using array-generated fluorescence data, to determine the clinical 
classification of the sample. 
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The reagents suitable for applying the screening methods of the invention to evaluate compounds may 
be packaged into convenient kits described herein providing the necessary materials packaged into suitable 
containers. 

The invention contemplates a kit for assessing the presence of endometrial cells, wherein the kit 
5 comprises antibodies specific for one or more endometrial markers, or primers or probes for polynucleotides 
encoding same, and optionally probes, primers or antibodies specific for other markers associated with an 
endometrial disease (e.g. cancer). 

The invention relates to a kit for assessing the suitability of each of a plurality of test compounds for 
inhibiting an endometrial disease (e.g. endometrial cancer) in a patient. The kit comprises reagents for assessing 
1 0 one or more endometrial markers or polynucleotides encoding same, and optionally a plurality of test agents or 
compounds. 

Additionally the invention provides a kit for assessing the potential of a test compoimd to contribute to 
an endometrial disease (e.g. cancer). The kit comprises endometrial diseased cells (e.g. cancer cells) and reagents 
for assessing one or more endometrial markers, polynucleotides encoding same, and optionally other markers 
15 associated with an endometrial disease. 
Therapeutic Applications 

One or more endometrial markers may be targets for immunotherapy. Immunotherapeutic methods 
include the use of antibody therapy, in vivo vaccines, and ex vivo immunotherapy approaches. 

In one aspect, the invention provides one or more endometrial marker antibodies that may be used 
20 systemically to treat an endometrial disease associated with the marker. In particular, the endometrial disease is 
endometrial cancer and one or more endometrial marker antibodies may be used systemically to treat endometrial 
cancer. Preferably antibodies are used tliat target the tumor cells but not the surrounding non-tumor cells and 
tissue. 

Thus, the invention provides a method of treating a patient susceptible to, or having a disease (e.g. 
25 cancer) that expresses one or more endometrial marker (in particular a marker up-regulated in endometrial 
cancer, for example, an up-regulated marker in Table 1), comprising administering to the patient an effective 
amoimt of an antibody that binds specifically to one or more endometrial marker. 

In another aspect, the invention provides a method of inhibiting the growth of tumor cells expressing 
one or more endometrial cancer markers, comprising administering to a patient an antibody which binds 
30 specifically to one or more endometrial cancer markers in an amount effective to inhibit growth of the tumor 
cells. 

One or more endometrial marker antibodies may also be used in a method for selectively inhibiting the 
growth of, or killing a cell expressing one or more endometrial marker (e.g. tumor cell expressing one or more 
endometrial cancer marker) comprising reacting one or more endometrial marker antibody immunoconjugate or 
35 immunotoxin with the cell in an amount sufficient to inhibit the growth of, or kill the cell. 

By way of example, unconjugated antibodies to endometrial cancer markers maybe introduced into a 
patient such that the antibodies bind to endometrial cancer marker expressing cancer cells and mediate growth 
inhibition of such cells (including the destruction thereof), and the tumor, by mechanisms which may include 
complement-mediated cytolysis, antibody-dependent cellular cytotoxicity, altering the physiologic function of 
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one or more endometrial cancer markers, and/or the inhibition of ligand binding or signal transduction pathways. 
In addition to unconjugated antibodies to endometrial cancer markers, one or more endometi ial cancer marker 
antibodies conjugated to therapeutic agents (e.g. immunoconjugates) may also be used therapeutically to deliver 
the agent directly to one or more endometrial cancer marker expressing tumor cells and thereby destroy the 
5 tumor. Examples of such agents include abrin, ricin A, Pseudomonas exotoxin, or diphtheria toxin; proteins such 
as tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth factor, 
tissue plasminogen activator; and biological response modifiers such as lymphokines, interleukin-I, interleukin- 
2, interleukin-6, granulocyte macrophage colony stimulating factor, granulocyte colony stimulating factor, or 
otlier growtii factors. 

1^ Cancer immunotherapy using one or more endometrial cancer marker antibodies may utilize the various 

approaches that have been successfully employed for cancers, including but not limited to colon cancer (Arlen et 
al., 1998, Crit Rev Immmiol 18: 133-138), multiple myeloma (Ozaki et al., 1997, Blood 90: 3179-3186; 
Tsunenati et al., 1997, Blood 90: 2437-2444), gastric cancer (Kasprzyk et al., 1992, Cancer Res 52: 2771-2776), 
B-cell lymphoma (Funakoshi et al., 1996, J Immunther Emphasis Tumor Immunol 19: 93-101), leukemia (Zhong 
15 et al, 1996, LeukRes 20: 58 1-589), colorectal cancer (Moun et al., 1994, Cancer Res 54: 6160-6166); Velders et 
al., 1995, Cancer Res 55: 4398-4403), and breast cancer (Shepard et al, 1991, J Clin Immunol 11: 117-127). 

In the practice of a method of the invention, endometrial cancer marker antibodies capable of inliibiting 
the growth of cancer cells expressing endometrial cancer markers are administered in a therapeutically effective 
amount to cancer patients whose tumors express or overexpress one or more endometrial cancer markers. The 
20 invention may provide a specific, effective and long-needed treatment for endometrial cancer. The antibody 
therapy methods of the invention may be combined with other therapies including chemotherapy and radiation. 

Patients may be evaluated for the presence and level of expression or overexpression of one or more 
endometrial markers in diseased cells and tissues (e.g. tumors), in particular using immunohistochemical 
assessments of tissue, quantitative imaging as described herein, or other techniques capable of reliably indicating 
25 the presence and degree of expression of one or more endometrial markers. Immunohistochemical analysis of 
tumor biopsies or surgical specimens may be employed for this purpose. 

Endometrial marker antibodies useful in treating disease (e.g. cancer) include those that are capable of 
initiating a potent immune response against the disease ( e.g. tumor) and those that are capable of direct 
cytotoxicity. In this regard, endometrial marker antibodies may elicit cell lysis by either complement-mediated or 
30 antibody-dependent cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc portion of the 
immunoglobulin molecule for interaction with effector cell Fc receptor sites or complement proteins. 

Endometrial marker antibodies that exert a direct biological effect on tumor growth may also be useful 
in the practice of the invention. Such antibodies may not require the complete immunoglobulin to exert the 
effect. Potential mechanisms by which such directly cytotoxic antibodies may act include inhibition of cell 
35 growth, modulation of cellular differentiation, modulation of tumor angiogenesis factor profiles, and the 
induction of apoptosis. The mechanism by which a particular antibody exerts an anti-tumor effect may be 
evaluated using any number of in vitro assays designed to determine ADCC, antibody- dependent macrophage- 
mediated cytotoxicity (ADMMC), complement-mediated cell lysis, and others known in the art. 
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The anti-tumor activity of a particular endometrial cancer marker antibody, or combination of 
endometrial cancer marker antibodies, may be evaluated in vivo using a suitable animal model. Xenogenic cancer 
models, where human cancer explants or passaged xenograft tissues are introduced into immune compromised 
animals, such as nude or SCID mice, may be employed. 

5 The methods of the invention contemplate the administration of single endometrial marker antibodies as 

well as combinations, or "cocktails", of different individual antibodies such as those recognizing different 
epitopes of other markers. Such cocktails may have certain advantages inasmuch as they contain antibodies that 
bind to different epitopes of endometrial markers and/or exploit different effector mechanisms or combine 
directly cytotoxic antibodies with antibodies that rely on immune effector functionality. Such antibodies in 

10 combination may exhibit synergistic therapeutic effects. In addition, the administration of one or more 
endometrial marker specific antibodies may be combined with other therapeutic agents, including but not limited 
to chemotherapeutic agents, androgen-blockers, and immune modulators (e.g., IL2, GM-CSF). The endometrial 
marker specific antibodies may be administered in their "naked" or unconj ugated form, or may have flierapeutic 
agents conjugated to them. 

15 The endometrial marker specific antibodies used in the methods ofthe invention may be formulated into 

pharmaceutical compositions comprising a carrier suitable for the desired delivery method. Suitable carriers 
include any material which when combined with the antibodies retains the function ofthe antibody and is non- 
reactive with the subject's immune systems. Examples include any of a number of standard pharmaceutical 
carriers such as sterile phosphate buffered saline solutions, bacteriostatic water, and the like (see, generally, 

20 Remington*s Pharmaceutical Sciences 16.sup.th Edition, A. OsaL, Ed., 1980). 

One or more endometrial marker specific antibody formulations may be administered via any route 
capable of delivering the antibodies to the a disease (e.g. tumor) site. Routes of administration include, but are 
not limited to, intravenous, intraperitoneal, intramuscular, intratumor, intradermal, and the like. Preferably, the 
route of administration is by intravenous injection. Antibody preparations may be lyophilized and stored as a 

25 sterile powder, preferably under vacuum, and then reconstituted in bacteriostatic water containing, for example, 
benzyl alcohol preservative, or in sterile water prior to injection. 

Treatment will generally involve the repeated administration of the antibody preparation via an 
acceptable route of administration such as intravenous injection (IV), at an effective dose. Dosages will depend 
upon various factors generally appreciated by those of skill in Hie art, including the type of disease and the 

30 severity, grade, or stage ofthe disease, the binding affinity and half life ofthe antibodies used, the degree of 
endometrial marker expression in the patient, the extent of circulating endometrial markers, the desired steady- 
state antibody concentration level, frequency of treatment, and the influence of any chemotherapeutic agents used 
in combination with tlie treatment method ofthe invention. Daily doses may range from about 0. 1 to 100 mg/kg. 
Doses in the range of 10-500 mg antibodies per week may be effective and well tolerated, although even higher 

35 weekly doses may be appropriate and/or well tolerated. A determining factor in defining the appropriate dose is 
the amount of a particular antibody necessary to be therapeutically effective in a particular context. Repeated 
administrations may be required to achieve disease inhibition or regression. Direct administration of one or more 
endometrial marker antibodies is also possible and may have advantages in certain situations. 

Patients maybe evaluated for serum cancer markers in order to assist in the determination ofthe most 
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effective dosing regimen and related factors. The endometrial cancer assay methods described herein, or similar 
assays, may be used for quantitating circulating endometrial marker levels in patients prior to treatment. Such 
assays may also be used for monitoring throughout therapy, and may be useful to gauge therapeutic success in 
combination with evaluating other parameters such as serum levels of endometrial markers. 
5 The invention further provides vaccines formulated to contain one or more endometrial marker or 

fragment thereof 

In an embodiment, the invention provides a method of vaccinating an individual against one or more 
endometrial marker listed in Table 1 comprising the step of inoculating the individual with the marker or 
fragment thereof that lacks activity, wherein the inoculation elicits an immune response in the individual thereby 

10 vaccinating the individual against the marker. 

The use in anti-cancer therapy of a tumor antigen in a vaccine for generating humoral and cell-mediated 
immunity is well known and, for example, has been employed in prostate cancer using human PSMA and rodent 
PAP immunogens (Hodge et al., 1995, Int. J. Cancer 63: 231-237; Fong et al., 1997, J. Immunol. 159: 3113- 
3117). These and similar methods can be practiced by employing one or more endometrial markers, or fragment 

15 thereof, or endometrial pol3^ucleotide markers and recombinant vectors capable of expressing and appropriately 
presenting endometrial marker immunogens. 

By way of example, viral gene delivery systems may be used to deliver one or more endometrial 
polynucleotide markers. Various viral gene delivery systems which can be used in the practice of this aspect of 
the invention include, but are not limited to, vaccinia, fowlpox, canarypox, adenovirus, influenza, poliovirus, 

20 adeno-associated virus, lentivirus, and sindbus virus (Restifo, 1996, Curr. Opin. Immunol. 8: 658-663). Non- viral 
delivery systems may also be employed by using naked DNA encoding one or more endometrial cancer marker 
or fragment thereof introduced into the patient (e.g., intramuscularly) to induce an anti-tumor response. 

Various ex vivo strategies may also be employed. One approach involves the use of cells to present one 
or more endometrial marker to a patient's immune system. For example, autologous dendritic cells which express 

25 MHC class I and II, may be pulsed with one or more endometrial marker or peptides thereof that are capable of 
binding to MHC molecules, to thereby stimulate the patients' immime systems (See, for example, Tjoa et al., 
1996, Prostate 28: 65-69; Murphy et al., 1996, Prostate 29: 371-380), 

Anti-idiotypic endometrial marker specific antibodies can also be used in therapy as a vaccine for 
inducing an immune response to cells expressing one or more endometrial marker. The generation of anti- 

30 idiotypic antibodies is well known in the art and can readily be adapted to generate anti-idiotypic endometrial 
cancer marker specific antibodies that mimic an epitope on one or more endometrial cancer markers (see, for 
example, Wagner et al., 1997, Hybridoma 16: 33-40; Foon et al., 1995, J Clin Invest 96: 334-342; Herlyn et al., 
1996, Cancer Immunol Immunother 43: 65-76). Such an antibody can be used in anti-idiotypic therapy as 
presently practiced with other anti-idiot5/pic antibodies directed against antigens associated with disease (e.g. 

35 tumor antigens). 

Genetic immunization methods may be utilized to generate prophylactic or therapeutic humoral and 
cellular immune responses directed against cells expressing one or more endometrial cancer marker. One or more 
DNA molecules encoding endometrial markers, constructs comprising DNA encoding one or more endometrial 
markers/inununogens and appropriate regulatory sequences may be injected directly into muscle or skin of an 



wo 2005/061725 



PCT/CA2004/002172 



-49- 

individual, such that the cells of the muscle or skin take-up the construct and express the encoded endometrial 
markers/immunogens. The endometrial markers/immunogens may be expressed as cell surface proteins or be 
secreted. Expression of one or more endometrial markers results in the generation of prophylactic or therapeutic 
humoral and cellular inununity against the disease (e.g. cancer). Various prophylactic and therapeutic genetic 
5 immunization techniques known in the art may be used. 

The invention further provides methods for inhibiting cellular activity (e.g. , cell proliferation, activation, 
or propagation) of a cell expressing one or more endometrial marker. This method comprises reacting 
immunoconjugates of the invention (e.g., a heterogeneous or homogenous mixture) with the cell so that 
endometrial markers form complexes with the immunoconjugates. A subject with a neoplastic or preneoplastic 

10 condition can be treated when the inhibition of cellular activity results in cell death. 

In another aspect, the invention provides methods for selectively inhibiting a cell expressing one or 
more endometrial marker by reacting any one or a combination of the immunoconjugates of the invention with 
the cell in an amount sufficient to inhibit the cell. Amounts include those that are sufficient to kill the cell or 
sufficient to inhibit cell growth or proliferation. 

15 Vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 

plasmids, may be used to deliver polynucleotides encoding endometrial cancer markers to a targeted organ, 
tissue, or cell population. Methods well known to those skilled in the art may be used to construct recombinant 
vectors that will express antisense polynucleotides for endometrial markers. (See, for example, the techniques 
described in Sambrook et al (supra) and Ausubel et al (supra)). 

20 Methods for introducing vectors into cells or tissues include those methods discussed herein and which 

are suitable for in vivo, in vitro and ex vivo therapy. For ex vivo therapy, vectors may be introduced into stem 
cells obtained from a patient and clonally propagated for autologous transplant into the same patient (See U.S. 
Pat. Nos. 5,399,493 and 5,437,994). Delivery by transfection and by liposome are well known in the art. 

Genes encoding endometrial markers can be turned off by transfecting a cell or tissue with vectors that 

25 express high levels of a desired endometrial marker-encoding fragment. Such constructs can inundate cells with 
untranslatable sense or antisense sequences. Even in the absence of integration into the DNA, such vectors may 
continue to transcribe RNA molecules until all copies are disabled by endogenous nucleases. 

Modifications of gene expression can be obtained by designing antisense molecules, DNA, RNA or 
PNA, to the regulatory regions of a gene encoding an endometrial marker, i.e., the promoters, enhancers, and 

30 introns. Preferably, oligonucleotides are derived from the transcription initiation site, e.g. between - 10 and +10 
regions of the leader sequence. The antisense molecules may also be designed so that they block translation of 
mRNA by preventing the transcript from binding to ribosomes. Inhibition may also be achieved using "triple 
helix" base-pairing methodology. Triple helix pairing compromises the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Therapeutic advances 

35 using triplex DNA were reviewed by Gee J E et al (In: Huber B E and B I Carr (1994) Molecular and 
Immunologic Approaches, Futura Publishing Co, Mt Kisco N. Y.). 

Ribozjones are enzymatic RNA molecules that catalyze the specific cleavage of RNA. Ribozymes act 
by sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. The invention therefore contemplates engineered hammerhead motif ribozyme 
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molecules that can specifically and efficiently catalyze endonucleolytic cleavage of sequences encoding an 
endometrial marker. 

Specific ribozyme cleavage sites within any potential RNA target may initially be identified by scanning 
the target molecule for ribozyme cleavage sites which include the following sequences, GUA, GUU and GUC. 
5 Once the sites are identified, short RNA sequences of between 15 and 20 ribonucleotides corresponding to the 
region of the target gene containing the cleavage site may be evaluated for secondary structural features which 
may render the oligonucleotide inoperable. The suitability of candidate targets may also be determined by testing 
accessibility to hybridization with complementary oligonucleotides using ribonuclease protection assays. 

One or more endometrial markers and polynucleotides encoding the markers, and fragments thereof, 
10 may be used in the treatment of an endometrial disease (e.g. endometrial cancer) in a subject. In an aspect the 
endometrial markers and polynucleotides encoding the markers are endometrial cancer markers that are down- 
regulated in endometrial cancer, for example, a down-regulated marker in Table 1. The markers or 
polynucleotides may be formulated into compositions for administration to subjects suffering from an 
endometrial disease. Therefore, the present invention also relates to a composition comprising one or more 
15 endometrial markers or polynucleotides encoding the markers, or a fragment thereof, and a pharmaceutically 
acceptable carrier, excipient or diluent. A method for treating or preventing an endometrial disease in a subject is 
also provided comprising administering to a patient in need thereof, one or more endometrial markers or 
polynucleotides encoding the markers, or a composition of the invention. 

The invention further provides a method of inhibiting an endometrial disease (e.g. endometrial cancer) 
20 in a patient comprising: 

(a) obtaining a sample comprising diseased cells from the patient; 

(b) separately maintaining aliquots of the sample in the presence of a plurality of test agents; 

(c) comparing levels of one or more endometrial markers, and/or polynucleotides encoding one or 
more endometrial markers in each aliquot; 

25 (d) administering to the patient at least one of the test agents which alters the levels of the 

endometrial markers, and/ or polynucleotides encoding one or more endometrial markers in the 
aliquot containing that test agent, relative to the other test agents. 
Endometrial markers in uterine biopsy tissue or fluid and sera may vary between known fertile and 
infertile women during the window of implantation, deviate in women undergoing ovarian 
30 hyperstimulation/ovulation induction, and correlate with successful initiation of pregnancy. Therefore, 
endometrial markers of the invention may serve as minimally or noninvasive markers of uterine receptivity for 
implantation. 

The present invention further provides a method of determining uterine endometrial receptivity by first 
obtaining a serum, uterine fluid or endometrial biopsy sample from a patient and detecting the presence of an 
35 endometrial marker associated with a certain endometrium phase, wherein the presence or absence of an 
endometrial marker as compared to controls indicates uterine receptivity. In an embodiment, the endometrium 
phase is the secretory phase. Where necessary for the evaluation, repetitive samples maybe collected tliroughout 
the menstrual cycle. Non-receptive controls are both women who are in the non-fertile stage of the menstrual 
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cycle and women with known uterine dysfunction where an endometrial marker is not present or present on the 
endometrium throughout the menstrual cycle or certain endometrium phases. 

The.present invention further provides a method of monitoring the effects of ovarian hyperstimulation 
and/ or ovulation induction protocols on uterine receptivity either for individual women receiving the treatment or 
5 for the evaluation of new protocols. In an embodiment, the method comprises: (a) obtaining a serum, uterine or 
fluid or endometrial biopsy sample from a patient receiving the treatments; and (b) detecting the presence of an 
endometrial marker of the invention present in the endometrium at the time of fertilization, early embryogenesis, 
and implantation; wherein presence or absence of an endometrial marker indicates receptivity. A disruption of 
the normal cyclic presence of an endometrial marker indicates that the treatment may adversely affect uterine 
10 receptivity. This disruption may include non-cyclic presence of an endometrial marker or an aberrant presence of 
an endometrial marker as compared to controls. 

• In an aspect the invention provides a method of determining a probability of successful implantation 
with an ovarian stimulation in vitro fertilization and embryo transfer procedure, comprising: 

(a) determining a level of an endometrial marker identified in accordance with a method of the 
15 invention in a sample obtained from a patient who has undergone an ovarian stimulation in 

vitro fertilization and embryo transfer procedure; and 

(b) determining a probability of successful implantation based on the patient's determined 
endometrial marker level; 

wherein a significantly different endometrial marker level relative to a standard level is associated with a 

20 decreased or increased probability of successful implantation. 

The present invention further provides a method of contraception by interrupting the cyclic presence of 
an endometrial marker. The interruption can be to reduce or eliminate a marker present during the uterine 
receptivity window for implantation of the menstrual cycle and to thereby alter the cyclic presence/pattern of a 
marker. The interruption can utilize an antagonist of a marker. The term antagonist or antagonizing is used in its 

25 broadest sense. Antagonism can include any mechanism or treatment that results in inhibition, inactivation, 
blocking or reduction or alteration of cyclic presence of an endometrial marker. 

An active therapeutic substance described herein may be administered in a convenient manner such as 
by injection (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal 
administration. Depending on the route of administration, the active substance may be coated in a material to 

30 protect the substance from the action of enzymes, acids and other natural conditions that may inactivate the 
substance. Solutions of an active compound as a free base or pharmaceutically acceptable salt can be prepared in 
an appropriate solvent with a suitable surfactant. Dispersions may be prepared in glycerol, liquid polyethylene 
glycols, and mixtures thereof, or in oils. 

The compositions described herein can be prepared by per se known methods for the preparation of 

35 pharmaceutically acceptable compositions which can be administered to subjects, such that an effective quantity 
of the active substance is combined in a mixture with a pharmaceutically acceptable vehicle. Suitable vehicles 
are described, for example, in Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences, 
Mack Publishing Company, Easton, Pa., USA 1985). On this basis, the compositions include, albeit not 
exclusively, solutions of the active substances in association with one or more pharmaceutically acceptable 
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vehicles or diluents, and contained in buffered solutions with a suitable pH and iso-osmotic with the 
physiological fluids. 

The compositions are indicated as therapeutic agents either alone or in conjunction with other 
therapeutic agents or other forms of treatment. The compositions of the invention may be administered 
5 concurrently, separately, or sequentially with other therapeutic agents or therapies. 

The therapeutic activity of compositions and agents/compounds identified using a method of the 
invention and may be evaluated in vivo using a suitable animal model. 

The following non-limiting examples are illustrative of the present invention: 
Example 1 

10 Isotope-coded affmity tag (ICAT) analysis with a cleavable tag was used to examine differentially 

expressed proteins in proliferative and secretory endometria. Sample complexity of the tissue homogenates was 
reduced by strong cation exchange (SCX) fractionation and subsequent affinity cleanup. Analysis often SCX 
fractions in triplicate by nanobore liquid chromatography - tandem mass spectrometry (nanoLC-MS/MS) 
resulted in the identification of approximately 400 labeled proteins and the discovery of potential biomarkers for 

15 the secretory phase of the endometrial cycle. This study demonstrated the feasibility of using this approach for 
identifying jnarkers at a proteomics level for different stages of the endometrial cycle. 
. Tissue Samples 

Endometrium tissue was retrieved fi:om an in-house dedicated, research endometrial tissue bank. All 
tissues were snap firozen in liquid nitrogen within 15-20 minutes of devitalization at the time of hysterectomy, 

20 and were obtained with patient consent. In each case, the endometrium was classified as proliferative or secretory 
by a pathologist. The histological classification was verified by examination of a histopathologic section fi-om the 
frozen research tissue. Tissue was taken for proteomic analysis from the mirror-face of the residual block. After 
addition of 1 ml Hanks' Balanced Salt Solution containing protease inhibitors (leupeptin, aprotinin, pepstatin at 1 
|xg/mL), the tissue was mechanically homogenized at 30,000 rpms using a Polytron PT 1300D handheld 

25 homogenizer (Brinkmann, Westbury, USA). The samples were stored in aliquots at -80°C and/or submitted for 
protein profiling. These whole tissue homogenates contain endometrial epithelium, supportive stroma and 
vessels, as well as any secretions. Tissue samples ft*om six different individuals were selected for the study. 
Three of these tissues were classified as proliferative endometria (PROl, PR02 and PROS) and the other three as 
secretory endometria (SECI, SEC2, SEC3). 

30 Chemicals 

Acetonitrile, formic acid, potassium chloride, monobasic potassium phosphate, leupeptin, aprotinin, 
pepstatin and Hanks' Balanced Salt Solution were obtained from Sigma- Aldrich (Oakville, Canada). All 
reagents and buffers for the cleavable ICAT sample preparation procedure were from Applied Biosystems 
(Foster City, USA). 
35 ICAT Sample Preparation Procedure 

After removal of cell debris by centrifiigation, the total protein content for each of the six clarified 
homogenates was measured using a commercially available Bradford protein assay reagent (Bio-Rad, Hercules, 
USA). ICAT sample preparation procedure was carried out according to the cleavable ICAT protocol (Applied 
Biosystems, Foster City, USA) and is illustrated in Figure 1. Following denaturing and reducing steps, 100 (xg 
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total protein of each of the samples was labeled with either the light ICAT reagent (proliferative samples) or the 
heavy reagent (secretory samples). The labeled PRO 1 and SECl samples were combined to form ICAT Sample 
A, PR02 and SEC2 form ICAT Sample B, and lastly PR03 and SEC3 form ICAT Sample C. Mixing of the 
labeled proliferative and secretory samples in pairs in this manner ensures that any protein or peptide losses 
5 during subsequent processing steps is the same for both samples in a pair. Since the peptides are differentially, 
labeled, they can be traced to specific samples in the pair. Any difference detected in the levels of individual 
peptides can then be ascribed solely to initial differences in expression level. Chromatographic separation using a 
strong cation exchange (SCX) column was performed after trypsin digestion to fractionate the ICAT samples. 
Selected SCX fractions were purified by affinity chromatography according to the ICAT protocol, and 

10 subsequently analyzed by nanobore liquid chromatography-tandem mass spectrometry (nanoLC-MS/MS). 
Instrumentation for SCX fractionation 

SCX fractionation was performed on an HP1050 LC system (Agilent, Palo Alto, USA) using a 1.5 ml 
injection loop and a SF-2120 Super Fraction Collector (Advantec MPS, Dublin, USA). 
LC/MS/MS Instrumentation 

15 The LC system from LC Packmgs (Amsterdam, The Netherlands) consisted of a Famos autosampler 

and Ultimate Nano LC system. It was interfaced to an API QSTAR Pulsar QqTOF mass spectrometer (Applied 
Biosystems/MDS Sciex, Foster City, USA) using a Protana NanoES ion source (Protana Engineering A/S, 
Odense, Denmark). PicoTip's SilicaTip emitters with a 10 \xm tip i.d. (New Objective, Wobum, USA) were used 
as spray capillaries. All data were acquired using Analyst QS SP5 with Bioanalyst Extension 1.1 and analyzed 

20 with ProICAT SP2 software (Applied Biosystems/MDS Sciex, Foster City, USA). 
LC Conditions 

Strong cation exchange chromatography (SCX) was performed using a PolyLC Polysulfoethyl A 
column (The Nest Group, Southborough, USA) equipped with a guard column of the same material with the 
following dhnensions: 5-M.m particle size, 300-A bead, 2. 1-mm i.d., 10-mm length (guard column) and 100-mm 
25 length (analytical column). Eluent A of the mobile phase consisted of a 10 mM KH2PO4 solution in 25% 
acetonitrile and 75% deionized water (pH = 3.0). Eluent B consisted of a 10 mM KH2PO4 and 350 mM KCl 
solution in 25% acetonitrile and 75% deionized water (pH = 3 .0). In each case, 1 .5 ml of the total 2.4 ml sample 
(after acidifying with 2 ml Eluent A) was injected. Fractions were collected every 2 mins at a flow rate of 0.2 
ml/min. using a binary gradient with the following profile: 



30 



/[min] 0.01 2 58 60 

cfEluent B) [%} 0 0 100 stop 



All reversed-phase separations were performed using PepMap C18 nano capillary columns (LC 
Packings, Amsterdam, The Netherlands) with the following dimensions: 3-)im particle size, 100-Abead, 75-\xm. 
i.d. and 150-mm lengdi. Eluent A of the mobile phase consisted of 950 ml deionized water, 50 ml acetonitrile 
35 and 1 ml formic acid (pH w 3). Eluent B consisted of 50 ml deionized water, 950 ml acetonitrile and 1 ml formic 
acid. A binary gradient at a flow rate of approximately 200 nl/min with the following profile was used: 
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t[mm\ 0.01 5 125 135 157 160 163 190 

c(Eluent B) [YoJ 5 5 30 60 80 80 5 Stop 



Injections of 1 |xl of sample were performed in full loop mode. 
MS Conditions 

The source conditions were a curtain-gas setting of 20 and an ionspray voltage (in the range of 2000 - 
5 3 800 V) that was optimized daily. All data were obtained in the positive-ion detection mode. In the QO region, 
the instrument parameters were a declustering potential (DP) of 65 V and a focusing potential (FP) of 265 V. 
Nitrogen was used as the collision gas at a setting of CAD5 for both TOF-MS and MS/MS scans. 

All LC-MS/MS data were acquired in information-dependent acquisition (IDA) mode. A TOF-MS 
survey scan with a mass range of m/z = 400 - 1500 and 1 s scan time was followed by two product ion scans with 
10 a mass range of m/z = 70 - 2000 and 2 s scan time. The collision energy (CE) was automatically controlled by 
the IDA CE Parameters script. The switching criteria were set to ions greater than m/z = 400 and smaller than 
m/z = 1500 with a charge state of 2 to 5 and an intensity of > 10 counts/s. Former target ions were excluded for 
60 s and peaks within a 4 Th window were ignored. In addition, the IDA Extensions n script was set to 2 
repetitions before dynamic exclusion and to select a precursor ion nearest to a threshold of 15 count^s every 4 
15 cycles. 

RESULTS AND DISCUSSION 

Figure 2 shows examples of the histologic appearance of proliferative (Figure 2A) and secretory (Figure 
2B) endometrium. In both endometria the stratum basalis is characterized by a denser stroma than the 
physiologic responsive stratum functionalis above. Across the top of the stratum functionalis is the surface 
20 epithehum, which lines the endometrial cavity. The prohferative endometrium (PRO 2) shows small, coiled 
glands with lining columnar epithelium reaching to the surface. In contrast, the secretory endometrium (SEC2) is 
thicker, and contains more tortuous glands with intra-luminal secretions. The endometrium of both types of 
physiologic phases has abundant supportive stroma and vessels among the epithelial glands. In this study 
homogenates from the mirror faces of these tissues were used for quantitative analysis. After performing the 
25 procedure described above, SCX fractions 1 1 - 20 (of the 30 fractions) from each of the samples were chosen for 
further processing. This choice was based on the UV trace generated during fractionation. The 10 chosen 
fractions from each sample were affinity-purified, cleaved as per the ICAT protocol and analyzed using nanoLC- 
MS/MS. Figure 3 is an example of a nano LC-MS total ion chromatogram (TIC) from one of the ICAT Sample A 
SCX fractions. As the samples were run in IDA mode (See above), each such TIC resulted in hundreds of 
30 MS/MS spectra. A ProICAT confidence value of 75 was adopted after trial-and-error for reliable protein 
detection and identification for this initial investigation. This has resulted in identification of approximately 400 
distinct proteins. A preliminary classification of these proteins based on function, is seen in Figure 4. As 
expected the majority of proteins fall under one of the metabolic, housekeeping or structural categories. The 
proteins classified under "other" are proteins like antibodies, which could not be included with the previously 
35 mentioned categories. There were also a significant number of proteins for which function could not be assigned 
or were identified from cDNA matches and are therefore classified as hypothetical. This last category often 
contains the most interesting cases for biologists. Adopting a less stringent confidence value resulted in apparent 
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identification of many more proteins; the reliability of such identifications, however, was judged much poorer 
after manual inspection. In addition, manual inspection of the spectra was also found necessary to confirm 
automated quantifications in ProICAT. 

The results show that expression levels of the majority of proteins identified were not consistently 
5 different between the two phases of the endometrial cycle. This similarity in expression levels of proteins is not 
surprising since most of the abxmdant proteins detected tend to be housekeeping or structural in nature. In 
addition, similar or identical, housekeeping or structural proteins such as actin and tubulin are e3q)ressed in many 
tissue types. This is significant since whole tissue homogenates consist not only of epithelial cells, but also 
supportive stromal cells and interstitium, blood, vessels and any glandular secretions. These sources provide a 

1 0 major contribution to the protein profile of the overall tissue. These high abundance proteins from these adjacent 
cellular types may mask differential protein expression by a cellular component of the endometrium. Thirdly the 
differential expression of low abundance proteins in any one cellular type within proliferative and secretory 
endometrium may not have been detectable by the methodology used in this study. 

Despite these limitations, some instances of differential protein expression was noted. One such 

15 example is given in Figure 5, which shows very significant enhancement of expression of a protein, identified as 
glutamate receptor subunit zeta 1 precursor [SEQ ID NO. 26], in all three secretory samples- The triply charged 
series of peaks, starting at 581.6 Th, was MS/MS-analyzed and identified as the heavy-labeled version of the 
tryptic peptide LLTLALLFSCS VAR [SEQ ID NO. 28], which maps to the N-terminal region of the protein. The 
absence of the light-labeled analogue of this protein, which would have manifested as a series of peaks starting at 

20 578.6 Th, suggests a significantly lower level of the same protein in the proliferative samples. This is the first 
evidence of this protein being a marker for the endometrial secretory phase. 

A second example is shown in Figure 6 for the protein, macrophage migration inhibitory factor (MIF). 
Again, the protein's expression is enhanced in the secretory endometrium (1.71 ± 0.38 times) versus the 
prohferative endometrium. This relative quantification is based on the three sets of samples and is calculated 

25 from the ratios of the total area of the ion peaks within the heavy-labeled series to that of the corresponding light- 
labeled series. Previous studies have demonstrated that MIF is expressed by the human endometrium throughout 
the menstrual cycle and that this expression is predominantly in the glandular epithelial cells [9]. It was found 
that MIF localized throughout the glandular epithelial cytoplasm in the proliferative phase, but that this 
distribution changed during the secretory phase, when it localized to the apical portion of the glandular epithelial 

30 cells, and was also detected in glandular secretions. Macrophages are common in female reproductive tissues. In 
the endometrium, they play an important role in defense. Macrophage degradation of cellular debris and foreign 
material may play an important role in endometrial shedding and repair. Quantification of MIF levels using 
ELISA assays suggested a slight increase in the mean concentration in the secretory phase (from approximately 
15 to 1 8 ng / mg of protein based on a total of 25 samples), although the increase was not statistically significant 

35 [9] . By contrast, ICAT analysis of the six-sample set shows a significant enhancement (1 .71 ± 0.38 times) of MIF 
expression in the secretory endometrium. The results of this study serve to illustrate the power of the ICAT 
method for detecting and quantifying gross as well as subtle differences in expression levels over traditional 
quantitative methods relied on thus far. 
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CONCLUSIONS 

The ICAT technology employing the new, cleavable ICAT reagent is a powerful tool that can be used 
for discovering differentially expressed proteins, which could potentially be significant biomarkers of different 
histological cell states. The results demonstrate that for the human endometrium the expression levels of tlie 
5 majority of proteins do not vary significantly between the proliferative and the secretory phases. Two examples 
of differentially expressed proteins have been discussed in this study. Further investigation of the remainder of 
the sex fi-actions fi-om each of the three pairs of samples can be expected to yield more markers some of which 
might have implications for fertility/ infertility. 
Example 2 

10 The ICAT analysis as described in Example 1 was used to examine differentially expressed proteins in 

cancer and normal endometrial tissue. The results of the anaylsis are illustrated in Figure 1 1 which shows mass 
spectral windows from the ICAT experiments for three pairs of endometrial cancer/ normal samples, 
demonstrating the over expression of calgizzarin in the cancer samples. 
Example 3 

15 Material and Methods: 

i) Tissue preparation and histologic classification 

Endometrium and endometrial cancer tissues were retrieved from a dedicated, research in-house 
endometrial tissue bank. The consenting and tissue banking procedures for this tissue bank were approved by the 
relevant institutions. All tissues had been snap frozen in liquid nitrogen within 15-20 minutes of devitalization at 

20 the time of hysterectomy, and were obtained with patient consent. In each case, the endometrium was classified 
as non-malignant, or malignant by a pathologist. (26). Non-malignant endometrial cases included both normal 
physiologic states (atrophic, proliferative, secretory, menstrual) and pathologic states (benign endometrial polyp, 
disordered proliferative). Malignant endometrial cases included endometrioid, mucinous, and serous 
adenocarcinomas and malignant mixed Mullerian tumors (carcinosarcomas). This classification was performed 

25 using the routine surgical pathology sections. Study cases included only benign or malignant cases; cases of 
endometrial hyperplasia, some of which could be considered to represent an intermediate phenotype, were not 
included in this study. The histologic classification was verified by examination of a histopathologic section 
from the frozen research tissue. Tissue was taken for proteomic analysis from the mirror-face of the residual 
block. 

30 Tissue was thawed in Hanks' balanced salt solution (HBSS, Sigma) containing protease inhibitors 

(leupeptine, aprotinin, pepstatin in 1 \xg/mL) and followed by mechanical homogenation. The specimens were 
then stored in aliquots at -SO^C and/or submitted for protein profiling. These whole tissue homogenates contain 
both endometrial epithelium or carcinoma, supportive stroma and vessels, and any secretions. 

Immunohistochemical staining of selected malignant endometrial tissue was done using a polyclonal 

35 (rabbit) antibody against the putative tumor marker available from Calbiochem (San Diego, CA). Sections were 
cut from the paraffin embedded tissue, antibody applied in a 1:2000 dilution in Universal Strepavidin System, 
and immunohistochemical completed using a diaminobenzidine (DAB) chromogen. 

ii) Protein profiling 
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Tissue lysate was fractionated to reduce the sample complexity before protein profiling. An identical 
quantity of proteins was used for all samples within a method; HBSS was used to compensate the initial volume 
to ensure equal volimies for all samples. For CIS Zip-tip (Millipore) fractionation, 2 jig of proteins from 
endometrial tissue homogenate in the presence of 0,3% trifluroacetic acid (TFA) were loaded. After washing 
5 with water containing 0.3% TFA, 1 (xL of 60% acetonitrile with 0.3% TFA were used to elute proteins from C18 
directly onto a MALDI target containing pre-dried 1 |xL 10 mg/mL sinapinic acid in 60% acetonitrile. The dried 
protein spots were analyzed by a MALDI-TOF (Voyager DE-STR, Applied Biosystems) mass spectrometer. 

For protein profiling using SELDI-TOF MS, 1 jig proteins from endometrial tissue homogenate were 
incubated with WCX2, SAX2, IMAC, H50 surfaces according to the manufacturer's instructions. In brief, 

10 samples were diluted to 55 with the corresponding binding buffer, spotted onto the appropriate ProteinChip 
surface, and incubated in a sealed BioProcessor for one hour at room temperature. The ProteinChip surface was 
washed twice with the appropriate buffer for five-minutes, briefly rinsed with water and air-dried. Two times 0.5 
\iL of 50% saturated sinapinic acid in 50% acetonitrile was applied on the samples to form crystals. The 
ProteinChips were analyzed using a linear TOF analyzer, PBSIIc (Protein Biology System lie, Ciphergen), or a 

15 quadrupole/TOF hybrid tandem mass spectrometer, QSTAR XL (Applied Biosystems/MDS Sciex). 
iii) Protein purification and identification 

A normal and an EmCa sample were subject to chromatographic separation in parallel to yield partially 
purified protein for identification. 500 \ig proteins from the whole tissue homogenate were fractionated using 
size exclusion (BioSep 2000, Phenomenex) at ImL/min flow with phosphate buffer (pH7.9) and 0.05% (w/v) 

20 sodium azide. Qne-millilitre fractions were collected; the eluates were then concentrated to 50 \iL with a silicon 
carbide-based spincolumn, (ProteoSpin, MDS Sciex). Five microlitres (10%) of concentrate was desalted by 
CI 8 zip-tip and analyzed with MALDI-TOF MS to locate the fractions containing the protein marker of interest 
(10,843 Da) in the EmCa sample. The fractions with the enriched 10,843 Da protein were diluted to 100 |aL with 
freshly prepared dithiothreitol (DTT) (5 mM final) in 150 mM Tris pH 8.5 buffer, and incubated at 60 Cfor one 

25 hour. Ten microlitres (10%) of the reaction mixture was desalted by CIS zip-tip and analyzed with MALDI- 
TOF MS to assess the effect of DTT on the protein of interest (see Result for detail). The remaining 90 p,L was 
precipitated by acetone (80%(v/v) final), resuspended in SDS sample buffer, and the proteins were resolved by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). Protein molecular weight markers 
(New England Biolabs) and cytochrome C (C-2506 Sigma) were included to guide the excision of gel portions 

30 containing the protein of interest. Intact proteins were extracted from the gel by 50 |liL extraction solution 
(formic acid/acetonitrile/isopropanol/water in a ratio of 50/25/15/10) at room temperature for four hours. The 
extracts were completely dried by SpeedVac and resuspended in 40 [iL 100 mM ammonium bicarbonate. Half of 
the resuspended proteins was. desalted by CI 8 zip-tip and analyzed with MALDI-TOF, the other half was 
digested in solution with 100 ng trypsin (Promega). The resulted tryptic peptides were analyzed by MALDI- 

35 QqTOF MS. The identity of the 10,843Da protein, chaperonin 10, was determined using amino acid sequence- 
tag analysis (Mascot, Matrix Science). 

Identification of chaperonin 1 0 was verified by western blot, 1 2 |xg of proteins was resolved by 8- 1 6% 
gradient SDS-PAGE and electrophoretically transferred to nitrocellulose membrane (MSI), Membranes were 



wo 2005/061725 



PCT/CA2004/002172 



-58- 

incubated for two hours with 1:1000 dilution of anti-chaperonin 10 antiserum (Stressgen) or ERKl antibodies 
(Santa Cruz) diluted in 5% milk with 0.2% NP-40 and one hour with secondary antibodies. Chaperonin 10 and 
ERKl were detected by chemiluminescence reagent (NEN) and X-ray film according to the manufacturer's 
instructions. 
5 Results: 

A total of 44 malignant and non-malignant endometrial tissue samples were submitted for proteomic 
analysis. Twenty-three of these cases were non-malignant, and the remaining 21 cases were malignant. The exact 
histopathologic diagnoses are shown in Tables 2 and 3. 
Protein profiling 

10 Both CIS zip-tip purification and retentive separation on ProteinChip WCX2 were effective in 

generating protein profiles that permit differentiation between normal and tumorous epithelium samples. 
Distinguishing features include both appearances and disappearances of proteins. One protein at approximately 
10,840 + 1 0 Da was present in all EmCa samples and absent or greatly diminished in all normal samples tested. 
Figure 7 shows the mass spectra obtamed on the ProteinChip WCX2 using (a) the linear TOF mass spectrometer, 

15 PBSIIc, and (b) the QqTOF mass spectrometer, QSTAR XL. The superior mass accuracy and resolution of the 
latter afforded determination of the molecular weight of the marker protein as 10,843 Da. 
Mariner protein purification and identification 

As detailed above and outlined in Figure 8 size-exclusion LC was employed to fractionate the tissue 
homogeneates. The target 10,843 Da protein was found to elute in one of the early fi-actions firom size exclusion 

20 colunm which suggests that this protein is a part of a large protein complex (Figure 8 A). After concentration on 
the spin column, the proteins were treated with DTT to break the tertiary structure and reduce any disulfide 
bonds. The molecular weight of the target protein was verified to remain as 10,843 Da after DTT treatment and 
before SDS-PAGE (Figure 8B). This result suggests thatthe 10,843 Da target protein contains no intra- or inter- 
polypeptide disulfide bonds. The remaining protein concentrates were further separated by SDS-PAGE, and the 

25 gel was stained with colloidal Coomassie Blue (Figure 8C). There were, however, no visible bands at the region 
around 10-11 kDa, probably a consequence of low protein concentration and relatively low sensitivity of the 
Coomassie stain. The gel portions covering approximate 7,000-16,000 Da were excised as guided by the 
molecular weight markers (Figure 8C) . After protein extraction, digestion with trypsin and MALDI analysis, six 
"unique" tryptic peptides were detected in the EmCa sample vs. the control (Figure 8D). All six peptides were 

30 sequenced by MALDI-QqTOF MS; three were traced to keratin and three to chaperonin 1 0 (Figure 8D and 8E). 
The average molecular weight of chaperonin 1 0 was calculated to be 1 0842.5 Da after considering two putative 
posttranslational modifications, the removal of the N-terminal metliionine and acetylation of the alanine residue. 
This is consistent with the measurement of the target protein and the reported molecular weight of chaperonin 1 0 
purified firom human platelet (IS). 

35 The differential expression of chaperonin 10 among cancerous and normal tissue was re-tested by 

western blot analysis. The signal of chaperonin 10 is higher in all EmCa specimens that also display a relative 
high 10,843 Da peak in their corresponding protein profiles (Figure 9). 

Tables 2 and 3 summarizes the results in identifying chaperonin 10 by MS and western blot analyses in 
non-malignant and malignant endometrial tissue respectively. The results for both MS and western blotting have 
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been reported using a semi-quantitative system ranging from absent (0) to 5+ (for high intensity). These two 
independent methodologies demonstrate consistency in the detection of chaperonin 10. 

Moderately strong immunohistochemical staining for chaperonin 10 was noted in the cytoplasm of an 
endometrioid adenocarcinoma, as compared to adjacent stroma and benign endometrial gland. (Figure 10). This 
5 result demonstrates the association of chaperonin 10 with malignant endometrial tissues, but much less with 
normal endometrial epithelium or supportive stroma and vessels, which were also present in the whole tissue 
homogeneates. 
Discussion: 

Genomic studies of endometrial carcinoma have revealed that there are two, and possibly three, types of 
1 0 endometrial carcinoma that are each characterized by multistep pathogenetic pathways characterized by different 
molecular profiles [28]. A wide variety of genetic and enzymatic markers characterize the initiation, promotion 
and progression toward each type of endometrial carcinoma. Large-scale messenger RNA expression analysis of 
the endometrioid type of endometrial carcinoma has identified 50 genes that are capable of discriminating normal 
from malignant endometrial tissues [29]. Many genes that are constitutively expressed in normal endometrium 
15 show either diminished or increased expression in endometrial carcinomas. In addition, there is aberrant 
expression of the one hundred hormonally regulated genes that are variably expressed in normal endometrial 
tissues, although finally endometrial carcinoma resembles proliferative endometrium more than secretory 
endometrium [29]. 

In contrast to the genomic studies of the endometrium, no studies of the endometrial proteome are 

20 available, even though proteome analysis may offer information about protein expression, functions, and 
modifications which might not be fully reflected by gene expression analysis. Thus, protein expression profiling 
of the endometrium offers a new opportunity to identify and classify endometrial phenotypes, including 
carcinoma. Among the available methodologies for protein profiling, SELDI-TOF based method has the 
advantage of requiring only miniscule samples and high tliroughput capability, although the determination of any 

25 protein identity requires much more work [30]. 

This proteomic study of endometrium used lysates of whole tissue homogenates from both control 
endometrial tissues and endometrial carcinomas. Such tissues include not only the epithelial cells of interest, but 
also supportive stroma (including both endometrial stromal cells or fibroblasts and extracellular matrix), blood 
vessels (including smooth muscle cells and endothelium), any secretions, and possibly small amounts of adjacent 

30 myometrium. While the use of such whole tissue homogenates is technically straight forward, it does have 
inherent limitations. The heterogeneous nature of the constituent tissue will lead to a similarly heterogeneous 
protein profiling [17], The resultant proteomic analysis reflects not only the cells of interest, but also the presence 
of contaminating cells [15, 17, 3 1]. Successful proteomic analysis of some types of tumors (e.g. hepatoma) may 
still be productive even with the limitation of whole tissue homogenization, since there is an abundance of tumor 

35 cells and minimal associated contaminating stroma [32]. 

In some tumor types, the inherent limitation of whole tissue homogenates must be surmounted by 
utilizing cell purification techniques. In the endometrium LCM may be optimal in achieving cellular purification 
for proteomic analysis since there is a relative abundance of stroma in endometrial carcinoma tissues [32], 
Studies using laser capture microdissection (LCM) of melanoma have clearly revealed a different protein profile 
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for melanoma than that of the surrounding epitheHum, and thus supports the use of LCM in proteomio profiling 
studies [21, 33-35]. 

Despite the limitations of using whole tissue homogenates for protein expression profiling, this 
preliminary study of endometrial carcinoma has shown that proteomic analysis of endometrial carcinoma can 
5 detect differences from that of normal endometrium. Furthermore, a specific protein (Chaperonin 10) was 
strongly associated with endometrial carcinoma cases. This potential marker might be clinically useful on tissue 
aspirates since its differential expression pattern can be detected without the LCM procedure. 

Chaperonin 10 was not identified exclusively in malignant endometrial tissues; low levels were detected 
in non-malignant endometrial tissues by both mass spectrometry and western blotting techniques (Table 2). 

10 Furthermore, there is no apparent association between any particular histopathologic classification and the 
detection of these low levels of chaperonin 10. In contrast, high levels of chaperonin 10 were detected in 17 of 22 
malignant endometrial tissues by either mass spectrometry and/or western blotting techniques (Table 3). The 
apparent absence of chaperonin 10 in the remaining five of the 22 malignant cases may be due to either true 
absence or technical factors in pre-analytic processing or proteomic analysis. In two of these five cases, re- 

1 5 examination of the corresponding mirror image histologic section revealed minimal tumor in one case (case 28), 
or abundant necrosis of tumor (case 44). Furthermore, specific protein peaks of interest may be obscured in less- 
than- optimal mass spectromectric analysis or by adjacent protein peaks. 

Chaperonin 10 (Cpn 10) is a heat shock protein (HSP) that functions intra-cellularly as a molecular 
chaperone for nascent proteins [36, 37]. HSP' s are ubiquitous intra-cellular proteins that ensure homeostasis of 

20 metabolism [37]. Aberrations of HSP function, including chaperonin 10, have been described in a variety of 
pathologic conditions, including neoplasia [37, 38]. Furthermore, HSP's are differentially expressed in a variety 
of neoplasms. For example, immunohistochemical studies of both primary and secondary brain tumours have 
shown production of other HSP's [39], and that the expression of some HSP's may depend upon proliferating 
potential . Furthermore, the modulation of HSP expression profile has been shown to reflect the stage of prostatic 

25 carcinoma [37]. 

Immunohistochemical studies have identified HSP's within the nuclei and cytoplasm of epithelium, 
stroma, endothelium, -and lymphocytes of the endometrium, with certain types of HSP's showing preferential 
localization to certain cell types [40]. Whereas the expression of some HSP's occurs independent of the stage of 
the endometrial cycle (e.g. HSP90), the expression of other HSP's is cycle dependent. The expression of 

30 chaperonin 10 can be determined among proliferative, secretory, and menstrual endometria using proteomic, 
western blotting, and immunohistochemical methods. It is known that the amount of HSP27 and 60 is increased 
during late proliferative and early secretory phases, and subsequently reduced during mid- and late secretory and 
menstrual phases [40]. Studies of both decidualized endometrium (decidua) and placenta have shown that there 
are striking differences in the cellular localization of HSP's during normal human gestation. 

35 There have been no studies regarding the role of HSP 's or CpnIO in endometrial carcinoma. The eutopic 

endometrial glands from women with endometriosis and adenomyosis shows significantly increased expression 
of HSP's as compared to endometria fi:om control women — regardless pf the menstrual phase [41]. The 
abnormal expression of HSP's may play a role in the pathophysiology of both endometriosis and adenomyosis. 
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Early pregnancy factor (EPF) is an extra-cellular homologue of Cpn 1 0 that appears within 24 hours of 

I 

fertilization and persists throughout the first half of gestation [36]- It is necessary for embryonic development 
[42-44] and is immunosuppressive [45] . EPF is also detectable in animal models of liver regeneration and in the 
development of cancer [38, 46, 47]. An association between cellular growth and the appearance of extracellular 
5 EPF has been shown [38, 46, 47]. These findings suggest a role for EPF in neoplastic growth and that the 
detection of Cpn 10 in the serum of endometrial carcinoma has diagnostic potential. 

In conclusion, this protein profiling study of non-malignant and malignant endometrial tissues has 
identified chaperonin 10 as a tumor marker for endometrial malignancies. 
Example 4 

1 0 Tlie work described in the previous example using solid-phase extraction followed by matrix-assisted 

laser desorption/ionization mass spectrometry (MALDI MS) as well as selective surface binding and surface- 
enhanced laser desorption/ionization (SELDI MS, Ciphergen Biosystems Inc, CA, USA) has succeeded in 
identifying individual markers that show significant enhanced expression in EmCa tissues. The MALDI/SELDI 
MS strategy relies on side-by-side comparison of spectra fi-om proteins that have been selected via fast separation 

15 [120-124]. Using this methodology the identities of the proteins are unknown and identifying a given protein 
marker typically involves offline multidimensional chromatography, concentration, trypsin digestion and 
MS/MS. A second available strategy that highlights differentially expressed proteins involves differential 
tagging of proteins from samples that are being compared using isotope-coded affinity tag (ICAT) in an isotope- 
dilution mass spectrometry experiment [6]. This strategy has recently been applied to discover differentially 

20 expressed proteins between the proliferative and secretory phases of the human endometrium [49] using a 
cleavable, second- generation ICAT reagent (cICAT, Applied Biosystems Inc, CA, USA) (see above). Recently, 
a variation of the ICAT technology, iTRAQ (also from the Applied Biosystems Inc, CA, USA), has been 
introduced. Both cICAT and iTRAQ tagging permit online identification of multiple markers and relative 
quantification of these proteins. Although similar in their basic concepts, the two tagging reagents and 

25 methodologies differ in significant areas. The cICAT method relies on tagging cysteine residues and isolating 
peptides containing these tagged residues by affinity chromatography. The net result is a reduction in the 
complexity of peptide pools generated by digestion with proteases including trypsin [6]. In tlie case of the new 
iTRAQ method, tagging is on primary amines. This difference in labeling strategy eliminates the dependence on 
relatively nonabundant cysteine containing peptides intrinsic to ICAT-based methods, thus potentially allowing 

30 the tagging of most tryptic peptides. Other noteworthy features of the iTRAQ technology are that relative 
quantification is performed via MS/MS and that there are four possible tags, which permit multiplexing of up to 
four samples (tissue states) in a single experiment. Quantification is performed via the differences in abundances 
of four product ions, 1 14, 1 1 5, 1 16 and 1 17 Th that are each cleaved from one of the four possible tags. The tags 
have an identical mass, a result of differences in other parts of the iTRAQ tag structure, with the consequence 

35 that an identical peptide in the four samples will have an identical mass and LC retention time after tagging. This 
strategy simplifies analysis and will potentially increase analytical accuracy and precision. The multi- sample 
capability of the iTRAQ technology is ideally suited for this study, as it now provides a means to perform a 
proteomic analysis of both the major phases of the normal endometrium, while simultaneously comparing them 
against cancer samples. 
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This example describes a feasibility study that compares protein expression profiles between normal and 
cancerous endometria using iTRAQ as well as contrasts it against a similar feasibility study using the cICAT 
technology. Both iTRAQ and cICAT labeling afforded discovery of a number of differentially expressed 
proteins that are potential cancer markers (PCMs). There is little overlap in the PCMs discovered and identified, 
5 thus pointing to the complementary nature of the two technologies. It is noteworthy that application of the 
iTRAQ methodology permits confirmation of the overexpression of chaperonin 10 in EmCa tissues [50]; 
chaperonin 10 is an approximately 10 kDa heat shock protein that does not contain the cysteine residue, which is 
the tagging site for cICAT. Thus, in addition to an interest in the discovery of markers for endometrial 
carcinoma, these experiments have also helped to illustrate the relative strengths and differences of the two 
10 tagging techniques, when applied to studies of clinical samples. 
Materials and Methods 

Sample Preparation, Endometrial tissue was retrieved fi-om an in-house dedicated, research endometrial tissue 
bank. All tissues were snap fi-ozen in liquid nitrogen within 15-20 minutes of devitalization at the time of 
hysterectomy, and were obtained with patient consent. The patient consent forms and tissue-banking procedures 

15 were approved by the Research Ethics Boards of York University, Mount Sinai Hospital, University Health 
Network, and North York General Hospital. In each case, the endometrium was classified by a pathologist 
(TJC). Histological classification was verified by examination of a section jfrom the fi-ozen research tissue. 
Tissue for proteomic analysis was taken fi-om the mirror-face of the residual block. In the case of samples used 
for iTRAQ analysis, 0.5 ml phosphate buffered saline (PBS) containing protease inliibitors (ImM AEBSF, 10 

20 fiM leupeptin, 1 ^ig/ml aprotinin and 1 )xM pepstatin) was added. The tissue was then mechanically 
homogenized at 30,000 rpm using aPol3^on PT 1300D handheld homogenizer (Brinkmann, Westbury, USA). 
For samples used for cICAT analysis, tissues were similarly homogenized in 1 ml Hanks' Balanced Salt Solution 
with the same concentration of protease inhibitors as listed above. The samples were stored in ajiquots at -70°C 
until used for further processing. These whole tissue homogenates contained not only endometrial epithelium, 

25 but supportive stroma and vessels, as well as secretions. The iTRAQ analysis involved one normal proliferative, 
one normal secretory, and two cancer homogenates, while the cICAT analysis combined one normal proliferative 
homogenate with three different cancer homogenates in pair-wise comparisons. 

Chemicals. Reagent grade chemicals were purchased firom Sigma Aldrich (Oakville, ON, Canada), or Fisher 
Scientific (Nepean, ON, Canada). All iTRAQ and cICAT reagents and buffers were obtained fi-om Apphed 

30 Biosystems (Foster City, CA, USA). 

iTRAQ Sample Preparation Procedure, Cell debris fiom each of the homogenates was removed by 
centrifugation in a microfuge at 4°C for 30 min at 14,000 rpm. The clarified supernatant was then transferred to 
fresh microfuge tubes and the total protein content determined using a commercial Bradford assay reagent (Bio- 
Rad, Mississauga, ON, Canada). A standard curve for the Bradford assay was made using y-globulin as a 

35 control. 100 jj^g of each sample was then denatured and the cysteines blocked as described in the iTRAQ 
protocol (Applied Biosystems, Foster City, CA, USA). Each sample was then digested with 0,2 mL of a 50 
pg/mL trypsin (Promega) solution at 37 °C overnight and labeled with the iTRAQ tags as follows: normal 
proHferative endometrium, iTRAQ 1 14; normal secretory endometrium, iTRAQ 1 15; and the two EmCa samples. 
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iTRAQ 116 and iTRAQ 117. The labeled samples were then pooled and acidified by mixing with Eluent A (see 
later) to a total volume of 2.0 mL for strong cation exchange (SCX) chromatography. 1.5 mL of this acidified 
labeled sample was injected into an HP1050 LC system (Agilent, Palo Alto, CA, USA) with a 1.5 mL injection 
loop and a 2. 1 mm internal diameter (ID) x 100 mm length PolyLC Polysulfoethyl A column packed with 5 pun 
5 beads with 300 A pores (The Nest Group, Southborough, MA, USA). A 2.1 mm ID x 10 mm length guard 
column of the same material was plumbed upstream from the analytical colunrn. Fractionation was effected by a 
binary mobile-phase gradient at a total flow rate of 0.2 mL/min. Eluent A consisted of a 10 mM KH2PO4 
solution in 25% acetonitrile and 75% deionized water acidified to a pH of 3.0 with phosphoric acid. Eluent B 
consisted of a 10 mM KH2PO4 and 350 mM KCl solution in 25% acetonitrile and 75% deionized water acidified 

10 to a pH of 3.0 with phosphoric acid. Initially, the gradient comprised 100% Eluent A. At the 2"^ minute, the % 
Eluent B was changed linearly from 0 to 100% atthe 58*** min. The run was terminated at the 60* min. A total 
of 30 fractions were collected for the sample, one every two minutes, *using an SF-2120 Super Fraction Collector 
(Advantec MFS, Dubhn, CA, USA). Following fractionation, the samples were dried by speed vacuuming and 
stored at -20 **C. Prior to reverse phase nanobore liquid chromatography-tandem mass spectrometric (nanoLC 

15 MS/MS) analysis, these fractions were redissolved in an aqueous buffer containing 5% acetonitrile and 0. 1% 
formic acid for nanoLC MS/MS. . 

cICAT Sample Preparation Procedure. Samples were clarified and their total protein content determined as 
described above. cICAT sample preparation procedure was carried out according to the cleavable ICAT protocol 
(Applied Biosystems, Foster City, CA, USA). Tliree 100 \ig aliquots of the clarified normal proliferative 

20 homogenate were paired with 1 00 \ig each of the three clarified cancer homogenates separately. After denaturing 
and reduction, the normal homogenates were labeled with the light reagent while the EmCa homogenates were 
labeled with the heavy reagent. A light labeled sample was then mixed with one of the heavy labeled samples to 
form in total three ICAT sample pairs: A, B and C. The final volume of each sample was 0.2 mL. The sample 
pairs were then digested by incubating each pair with 0.2 mL of a 50 fig/mL trypsin solution at 37"C overnight. 

25 Afterwards each sample pair was mixed with 2.0 mL of Eluent A and fractionated into 30 fractions using SCX 
chromatography as described above. The fractions were screened by MALDI MS analysis. Those showing the 
signature ICAT peak pairs separated by 9 Da were further processed by affinity purification using the avidin 
cartridge provided witli the ICAT kit. The affinity-purified sample was then dried, treated with the cleavage 
reagent to eliminate biotin, and dried again by speed vacuuming. The resulting solids were re-dissolved in an 

30 aqueous buffer containing 5% acetonitrile and 0. 1% formic acid for nanoLC MS/MS. 

Nanobore LC/MS/MS. The nanobore LC system was from LC Packings (Amsterdam, The Netherlands) and 
consisted of a Famos autosampler and an Ultimate Nano LC system. It was interfaced to an API QSTAR Pulsar 
QqTOF mass spectrometer (Applied Biosystems/MDS Sciex, Foster City, CA, USA) using a Protana NanoES 
ion source (Protana Engineering A/S, Odense, Denmark). The spray capillary was a PicoTip SilicaTip emitter 

35 with a 1 0 jam ID tip (New Objective, Woburn, MA, USA). The nanobore LC column was a 75 jim ID x 150 mm 
length reverse-phase PepMap CI 8 nano capillary column (LC Packings, Amsterdam, The Netherlands) packed 
with 3 jLim beads with 100 A pores. One jiL of sample was injected via the full-loop mode. Separation was 
performed using a binary mobile-phase gradient at a total flow rate of 200 nL/min. Eluent A consisted of 94.9% 
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deionized water, 5.0% acetonitrile and 0. 1% formic acid (pH « 3). Eluent B consisted of 5.0% deionized water, 
94.9% acetonitrile and 0.1% formic acid. 

The following binary gradient was used for the iTRAQ experiments: 

Time(min) 0 5 10 125 135 155 160 162 188 
% Eluent B 5 5 15 35 60 80 80 5 Stop 

While the binary gradient used for cICAT experiments was: 



Time(min) 0 5 125 135 157 160 163 190 
% Eluent B 5 5 30 60 80 80 5 Stop 



For nanospray, the source conditions were a curtain-gas setting of 20 and an ionspray voltage in the 

10 range of 1800 - 3000 V that was optimized daily. In the QO region, the instrument parameters were a 
declustering potential (DP) of 65 V and a focusing potential (FP) of 265 V. Nitrogen was used as the collision 
gas at a setting of CAD = 5 for both TOF-MS and MS/MS scans. All nanoLC IVIS/NIS data were acquired in 
information-dependent acquisition (IDA) mode in Analyst QS SP8 with Bioanalyst Extension 1 , 1 (Applied 
Biosystems / MDS Sciex). Two sets of runs for the iTRAQ fractions were performed. For the first set, MS 

15 cycles comprised a TOF MS survey scan witli an m/z range of 400 - 1500 Th for 2 s, followed by tliree product 
ion scans with an m/z range of 70 — 2000 Th for 10 s each. For the second, a 1 s TOF MS survey scan, followed 
by three product ion scans of 3 s each were used. For cICAT experiments, the MS cycles consisted of a TOF MS 
survey scan for 1 s, followed by two product ion scans of 2 s each. The ranges for the TOF MS and product ion 
scans were the same as those of the iTRAQ experiments. Collision energy (CE) was automatically controlled by 

20 the IDA CE Parameters script. Switching criteria were set to ions greater than m/z = 400 Th and smaller than 
m/z ~ 1500 Th with a charge state of 2 to 5 and an abundance of > 10 counts/s. Former target ions were 
excluded for 60 s and ions within a 4 Th window were ignored. In addition, for iTRAQ runs, the IDA 
Extensions II script was set to no repetitions before dynamic exclusion for the first run, and one repetition for the 
second. While for the ICAT experiments, the IDA Extensions II script was set to two repetitions before dynamic 

25 exclusion. In all three experiments, the script was set to select a precursor ion nearest to a threshold of 1 5 count/s 
every 4 cycles. These settings ensured examination of not only high abundance ions, but low abundance ones as 
well. 

Data Analysis, Data analysis for the iTRAQ experiments was performed with ProQUANT 1.0, while that for 
cICAT experiments was with ProICAT' 1.0 SP3. The cut off for the confidence settings for both analyses was at 
30 75; that for the score was at 20. The tolerance set for peptide identification in ProQUANT searches were 0.15 Da 
for MS and 0. 1 Da for MS/MS, while those for ProICAT searches were, 0.2 Da and 0. 1 Da, respectively. All 
identifications were manually inspected for correctness. Relative quantification of proteins in the case of iTRAQ 
is performed on the MS/MS scans and is the ratio of the areas under the peaks at 1 14 Da, 115 Da, 1 16 Da and 
117 Da which are the masses of the tags that correspond to the iTRAQ reagents. Relative quantification of 
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proteins in the case of cICAT was performed on the TOF-MS scans by calculating the relative areas between the 
pairs of light and heavy label series of peaks. In both cases, the results of the quantification were normalized 
using the overall ratio obtained for all tagged peptide pairs in the sample. On the basis of previous studies 
utilizing cICAT reagents to compare secretory to proliferative endometrium, which involved paired tissue 
5 analysis of samples labeled with heavy and Kght isotopes, it was found that many proteins were expressed within 
a standard deviation (SD) of < 0.3 and an expression ratio close to 1.0, when protein expression was normalized 
against a housekeeping protein (n=6) [49]. Using this SD as an approximate benchmark for variation of protein 
expression among proteins whose expression is not altered when utilizing this teclinique, it was considered that a 
difference of 3 SDs i.e. 1.0 plus 0.9, or about a 2 fold difference, in normalized expression levels would be 

10 outside of the variation expected by chance, with a confidence limit of > 95% using a simple Gaussian 
approximation. This approximation would, therefore, apply to normalized expression levels > 2.0 or in 
reciprocal form < 0.5, as no assumption was made with regard to which tissue showed greater abundance. Using 
this approach, therefore, a two-fold expression change was chosen as an initial benchmark for potentially 
significant changes. In small sample sizes used for these initial studies, i.e. one sample in each group, at an alpha 

15 value of 0.05 (two-sided) and a beta value of 0.2 (80% power), the ratio of expression increase to SD must be at 
least 4 in order to achieve significance. This means that for minimal statistical significance to be achieved using 
the above alpha and beta parameters for a single pau-ed-tissue comparison, a 2-fold mean increase in expression 
must be associated with an SD less than 0.5 for replicate measurements in order to satisfy the sample size 
equation [n = 2 SD (Z^ +Zp) /(expression increase)^] =1, when the power index (Z^ +Zp)^ = 8 (a = 0.05, P 

20 0.2). For tihe sample size of three paired comparisons (one normal tissue and three tumor tissues were used in the 
cICAT experiment), tlie sample-size approximation of n = 2 in each group was used to compensate for the fact 
that one normal tissue was used (i.e. n = 1 in the control group, and n = 3 in the tumor group). The ratio of 
expression increase to SD must be greater than 2.8 in order to satisfy the above equation for an n = 2 (using the 
same a and p error limits of 0.05 and 0.2, respectively). 

25 Results 

iTRAQ runs 1 and 2 led to sequencing and identification of 645 and 1,026 peptides, respectively. The 
previous studies (see above) showed that a confidence setting of 75 and a score setting of 20 were optimal. 
Manual inspection of hundreds MS/MS spectra concluded that while these conditions were sufficiently stringent 
in that most identifications were correct, they also did not exclude too many peptides that could have been 

30 identified based on spectral quality. The MS/MS spectra of all peptides that scored above the cut offs were 
manually inspected to verify proper identifications. The shorter MS and MS/MS cycles used for run 2 resulted in 
a higher number of peptides identified. A significant number of these peptides were identified more than once, 
thus the number of unique peptides dropped to 292 and 312 for runs 1 and 2, respectively. Many of the more 
abundant proteins were identified by several peptides (Figure 12); the numbers of proteins identified were 101 

35 and 1 26, respectively. 63 proteins were identified in both runs. The numbers are modest, which were likely due 
to the small amount of starting materials.. Five proteins have abundance ratios that show more than a two-fold 
change (> 2.0 or < 0.5) in both cancer samples relative to the proliferative as well as the secretory endometrium, 
and meet the criteria of differential expression. These are shown in Table 4. 
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Relative quantification is expressed as three pair-wise ratios against the proliferative endometrium 
(iTRAQ 114) for the secretory endometrium (iTRAQ 115) and the two EmCa samples (iTRAQ 116 and 1 1 7). To 
accoimt for small differences in protein loadings across the samples, these ratios have been normalized using the 
overall ratios for all proteins in the samples, as recommended by Applied Biosystems. The rationale for this 
5 choice is based on the assumption that the relative abundances for the majority of proteins are close to one. This 
assumption is exemplified in this study in terms of the abundance ratios in secretory versus proliferative 
endometrium for the following abundant proteins: cytoplasmic actin, 0.98; alpha enolase, 0.91; alpha filamin, 
1.04; serum albumin precursor, 1.56; and tropomyosin alpha 4 chain, 0.99. Figure 13a shows the MS/MS 
spectrum for the doubly charged cytoplasmic actin peptide at 725.4 Th in one of the runs. The cluster of peaks 

10 around 1 15 Th is better shown in the mass spectral window 110 — 120 Th in Figure 13b, demonstrating near 
identical abundances in the four samples: proliferative (iTRAQlM), secretory (iTRAG115), EmCa 1 
(iTRAQl 16), and EmCa 2 (iTRAQl 17). 

Typical MS/MS windows (1 10 120 Th) for tryptic peptides in four differentially expressed proteins, 
(A) chaperonin 10, (B) alpha- 1 -antitrypsin precursor, (C) creatine kinase B, and (D) transgelin, are shown in 

15 Figure 14. Chaperonin 10 is overexpressed in both EmCa samples, whereas alpha- 1 -antitrypsin precursor, 
creatine kinase B, and transgelin are underexpressed. The remaining differentially expressed protein, pyruvate 
kinase Ml or M2 isozyme, is also overexpressed in the two EmCa saniples. 

The cICAT experiments led to identification and quantification of 68 proteins, all were manually 
verified. Again, the modest number stemmed fi-om the small sample size. Fewer proteins were identified by 

20 multiple peptides relative to iTRAQ (Figure 1 2), in accordance with the more selective nature of cICAT labeling 
on only cysteine residues. Five proteins that meet the two-fold differential expression criterion in ail three EmCa 
versus proliferative endometrium pairs are: calgizzarin, heterogeneous nuclear ribonucleoprotein (hnRNP) DO, 
macrophage migration inhibitory factor (MIF), polymeric immunoglobulin receptor (PIGR) precursor, and 
pyruvate kinase Ml or M2 isozyme. These results are summarized in Table 5. All five proteins are 

25 overexpressed in EmCa tissues; pyruvate kinase is also shown to be overexpressed with iTRAQ. Figure 15 
shows an example of overexpressed protein, calgizzarin A. Again, the relative abundance ratios are normalized 
to tlie overall ratio of all proteins in a given sample pair to account for small differences in protein loadings in tlie 
two samples. Cytoplasmic actin exhibits a heavy/light label ratio of 0.95 ± 0.24 (standard deviation of three 
samples). 

30 Discussion 

The combination of iTRAQ and cICAT labeling results in the discovery and identification of nine 
differentially expressed proteins that are potential cancer makers (PCMs) for EmCa. Six of the nine PCMs are 
overexpressed, whereas three are underexpressed in EmCa. 

PCMs Overexpressed in EmCa. Chaperonin 10 is a heat shock protein that was identified as a PCM using 
35 MALDI/SELDI MS and identified by offline separation, preconcentration, trypsinization and MS/MS (see 
above). Overexpression of chaperonin 10 in EmCa tissues was verified independently by Western analysis; 
chaperonin 10 was localized to the cancerous epithelium by means of immunohistochemistry (IHC). Elevation 
of levels of chaperonin 10 has been associated with large bowel and cervical carcinomas [53]. The level of 
chaperonin 10 in serum was demonstrated to be an indicator of trophoblastic tumor [54]. The observation of 
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chaperonin 10 overexpression in EmCa samples in this study with iTRAQ confirms the previous finding using 
MALDI/SELDI MS, Western analysis and IHC. Thus, chaperonin itf may be a serum marker protein for EmCa. 

Overexpression of pyruvate kinase Ml or M2 isozyme was demonstrated by both the ITRAQ and 
cICAT applications. Pyruvate kinase was found to be expressed at elevated levels in botli plasma and fecal 
5 samples in patients of gastrointestinal cancer [55,56] . The ability to detect pyruvate kinase in plasma samples is 
of particular interest as this might indicate that screening plasma for elevated pyruvate kinase levels in 
endometrial cancer cases may also be possible. An investigation into the activity of 12 enzymes related to the 
glycolytic pathway in cervical and endometrial cancers found that only two, pyruvate kinase and phosphoglucose 
isomerase, were significantly higher in both cancers [57]. 

1 0 Calgizzarin was found to be overexpressed in EmCa tissues using cICAT labeling. Calgizzarin was one 

of two proteins that exhibited significant upregulation in colorectal and lung carcinoma cell lines over normal 
colorectal mucosal cells [58]. Calgizzarin was one of three proteins that have been identified as tumor markers 
in mouse colon cancer [59]. The other two tumor markers were calgranulins A and B, the former of which was 
also identified as a protein marker in human endometrial carcinoma using SELDI MS. Identification was made 

15 possible by size-exclusion chromatography, trypsinization and onlme nanoLC MS/MS; confirmation of 
calgranulin A overexpression in EmCa was rendered by IHC in a tissue microarray format. 

HnRNP DO, also known as AU-rich element RNA-binding protein 1, binds to the AU rich 3' UTR of 
many proto- oncogenes- One study has shown that this protein is more abundant in murine neoplastic lung 
epithelial cell lines and that this abundance decreases when non-tumorigenic cells reach confluence or growth 

20 arrest [60] . Conversely, it was also found that the abundance was unaffected in spontaneous transformants from 
this cell line. Another more recent study using transgenic mice showed that overexpression in an isoform of this 
protein altered mRN A levels of several oncogenes, including c-myc, c-jun, c-fos and TNF-alpha [61]. The mouse 
line with the highest level of this isoform developed sarcomas [61]. 

Macrophage Migratory Inhibitory Factor is another protein that has been well documented as being 

25 involved with cancer [62] . This ranges fi-om hepatocellular carcinomas [63], to non small cell lung cancer [64], to 
brain tumors [65] and gliblastomas [66]. 

PIGR has previously been detected in serum fi:om patients with lung cancer and was shown to be 
significantly upregulated in the secretory component by an ELISA study involving 45 lung cancer patients 
compared with 45 control subjects [68]. Another study has also demonstrated a possible linkage between PIGR 

3 0 and bladder carcinoma with protein levels in serum being significantly increased in patients with transitional cell 
carcinoma [69]. 

PCMs Underexpressed in EmCa. Alpha- 1 -antitrypsin precursor is one of the three proteins that are 
underexpressed in EmCa tissues. Recent studies have shown downregulation of alpha- 1 -antitrypsin to be 
associated with mahgnant lymphoma as well as liver, lung, stomach, bladder and gall bladder cancer [70,71]. 
35 There is also evidence for upregulation contributing to enhanced cell migration and metastases of human colon 
cells in a rat model [72]. Such a contradiction might be explained by the effect of alpha- 1 -antitrypsin being 
tissue specific which would prove useful for distinguishing between forms of cancer. 

Creatine kinase B shown here as being underexpressed in EmCa has likewise been demonstrated as 
being downregulated in cervical, colon, and lung cancers but not in hepatocarcinoma [73,74]. 



wo 2005/061725 



PCT/CA2004/002172 



-68- 

Transgelin is another protein observed to be underexpressed in this study. It has also been implicated as 
being downregulated in breast and colon carcinomas as well as in a lung epithelial cell line [75,76]. Transgelin 
was isolated and identified as ah antigen from renal cell carcinoma; subsequent in-situ hybridization experiments, 
however, found that the malignant cells were negative with respect to transgelin and that the transgelin source 
5 was from the mesenchymal cells of the stroma [77]. Thus, downregulation of transgelin appears to be true for 
renal cell carcinoma as well. 

Proteins Showing Possible Differential Expression, There are four proteins that showed differential expression 
by cICAT, but a smaller than critical (two-fold) change by iTRAQ labeling. One such protein is cyclophilin A, 
which was observed as being overexpressed by approximately four-fold in the ICAT analysis; however, the 

10 iTRAQ experiments showed a smaller overexpression of 1.47 ± 0.29 across both cancer samples in both runs 
compared with the normal proliferative sample, which is not considered as signifrcant according to our criterion 
of a two-fold change. Cyclophilin A has recently been reported as overexpressed in lung cancer [76]. 

Triosephosphate isomerase is another protein showing overexpression by cICAT tagging. However, the 
relative standard deviation for this protein at d= 72% was also large, suggesting a variable expression level. There 

15 is a recent study that suggests triosephosphate isomerase to be upregulated by two-fold in lung cancer [78]. 
Triosephosphate isomerase level was found to be highly variable in renal cell carcinoma [79], iTRAQ labeling 
shows an overexpression of 1.55 db 0.28. 

Superoxide dismutase [Cu-Zn] is another protein that showed a large variation, ± 50%, in the cICAT 
results. This protein has been implicated in pancreatic adenocarcinoma [80]. iTRAQ results at 1.46 ± 0.08 is 

20 below the criterion of overexpression. Phosphatidylethanolamine binding protein showed a cICAT 
overexpression level of 3.7 ± LO and an iTRAQ overexpression level of 1.39 ±0.13. There is evidence for 
upregulation in rat hepatoma cell lines [81]. 

The use of isotope-coded affinity tags and nanoLC/MS/MS afforded examination of a large number of 
proteins for differential expression. This method is a lot more efficient and powerful in terms of the number of 

25 proteins that can be examined in a given experiment than SELDI MS. However, complex data analysis and tlie 
need for manual examination of MS/MS data limit the number of samples that can be examined within a given 
period of time. As a consequence, iTRAQ tagging involved two normal endometrial tissues versus two EmCa 
tissues, whereas cICAT pairing involved only one normal endometrial versus three EmCa tissues. By contrast, 
PCM discovery using SELDI MS involved in excess of 40 samples [see above]. A contribution to the 

3 0 uncertainties of the aforementioned four proteins may simply be individual variations in protein expression, both 
within the normal group as well as the EmCa group. The experience with the results of chaperonin 10 and 
calgranulin A shows that, while there is overexpression in EmCa tissues, there are considerable variations in 
protein abundances, possibly reflecting variability in the cellular subpopulations of the whole tissue homogenates 
or in the type or nature of the EmCa. In addition, there is variation in the abundances of these proteins across 

35 individual normal endometrial samples. 

Selectivity of PCMs. Literature data show that the nine differentially expressed proteins discovered and 
identified here are associated with cancers and are, indeed, potential cancer markers. Individually, these nine 
PCMs are nonspecific for endometrial carcinoma, as each has been linked with cancers other than EmCa. As the 
principal concern in screening, diagnosis, and monitoring of EmCa is the exclusion or omission of any malignant 
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endometrial disease, the fact that these PCMs have been noted in other cancers would rarely be a potential 
clinical drawback. In other words, for envisaged clinical use, the sensitivity is of far more concern than 
specificity for EmCa. It is probable that different cancers will share similar pathways in tumorigenesis, which 
will induce over- and under expression of similar proteins [57]. Nevertheless, it is unhkely that the differential 
5 expression pattern involving many proteins will be identical for all or some primary tumor sites. Their collective 
use as ?L panel may be more effective in indicating the site of origin of a cancer [82,84], despite the association of 
individual markers witli a variety of primary sites. 

The absolute expression level of PCMs will be an important issue. Knowing protein abundances in 
normal and diseased states will allow establisliment of threshold levels beyond which EmCa is signaled. iTRAQ 
10 permits simultaneous investigation of up to four samples, thus facilitating the inclusion of synthetic tryptic 
peptides of known amounts in absolute isotope-dilution experiments 

iTRAQ versus cICAT. The results obtained by iTRAQ and ICAT analyses suggest that the information 
generated by the two methods is complementary. There are, however, a few aspects on which each of these 
methods has advantages over the other. Quantification by cICAT can be compromised by overlapping peaks in 

15 the MS spectrum; this complication is resolved in iTRAQ as quantification is performed on the MS/MS 
spectrum. iTRAQ on the other hand requires processing samples separately until after the tryptic digestion. This 
increases the potential for errors introduced, as a result of sample handling or different extents of tryptic 
digestion. Another aspect on which iTRAQ differs is complexity because of the relatively nonspecific nature of 
labeling. Many more proteins are identified by multiple peptides. This is not necessarily a disadvantage, as it 

20 permits quantification of multiple peptides, thereby increasing the confidence in the ratios report. After 
classifying identified proteins into broad categories (Figure 16), it is apparent that there is a liigher proportion of 
signaling proteins identified by the cICAT method. Conversely, iTRAQ analysis identified a larger percentage of 
the more abundant ribosomal proteins and transcription factors. 
Example 5 

25 Summary 

Proteomic analyses of the proliferative and secretory phases of the human endometrium were carried out 
to identify proteins and discover differentially expressed proteins using isotope-coded affinity tags and three 
stages of chromatographic separation and onUne MS/MS. From an initial list of 346 proteins identified by 
PrblCAT, manual inspection of MS/MS spectra and confirmatory searches pared the list down to 119 positively 
30 identified proteins. Only five of the proteins showed consistent differential expression. The two proteins with 
unquestionable differential expressions in the secretory endometrium are: glutamate NMDA receptor subunit zeta 
1 precursor and FRATI. Some of the proteins that show no differential expression have previously been 
examined in gene-expression studies with similar conclusions. 

Below are reported results of a study to identify proteins in the human endometrium, and especially 
35 proteins that are differentially expressed in the proliferative and secretory phases. Differences in protein 
expression levels are highlighted and determined by the use of the cleavable ICAT reagent [8]. Proteins that are 
identified are compared with genes that have been examined for regulatory changes. 

The following materials and methods were used in the study. 
Materials and Methods 
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Tissue samples 

Endometrial tissues were retrieved from an in-house dedicated endometrial tissue bank. AH tissues had 
been snap-frozen in liquid nitrogen within 15-20 minutes of devitalization at the time of hysterectomy. All tissue 
procurements were carried out after informed patient consents. The consent, procurement and banking 
5 procedures were approved by the Research Ethics Boards of York University, Mount Sinai Hospital, University 
Health Network, and North York General Hospital. In every case, the endometrium was classified as 
proliferative or secretory by a pathologist. The histological classification was verified by examination of a 
histopathologic section from the frozen tissue. The mirror-face of the residual block was used for proteomic 
analysis. After addition of 1 ml of Hanks' Balanced' Sah Solution containing protease inhibitors (1 mM 4-(2- 

10 aminoethyl)benzenesuifonyl fluoride, IQ \iM leupeptin, 1 \xg/xicd aprotininand 1 fiM pepstatin), the tissue sample 
was mechanically homogenized at 30,000 rpm using a Polytron PT 1300D handheld homogenizer (Brinkmann, 
Westbury, USA). The whole tissue homogenate contained not only endometrial epithelium, but supportive 
stroma, vessels as well as secretions. The homogenate was stored in aliquot at -80°C and/or submitted for 
proteomic analysis. Samples from six different individuals were selected for this study. Three of these tissues 

1 5 had been classified as proliferative endometria (randomly designated as PRO 1 , PR02 and PR03), and the otiier 
three as secretory endometria (randomly designated as SECl, SEC2, SEC3). 

Figure 17 shows an example of the histologic appearances of (a) a proliferative (PR02) and (b) a 
secretory (SEC2) endometrium. In both endometria, the stratum basalis is characterized by a denser stroma than 
the physiologic responsive stratum functionalis above. Across the top of the stratum functionalis is the surface 

20 epithelium, which lines the endometrial cavity. The proliferative endometrium shows small, coiled glands with 
lining columnar epithelium reaching to the surface. By contrast, the secretory endometrium is thicker, and 
contains more tortuous glands with intra-luminal secretions. The endometria of both types of physiologic phases 
have abundant supportive stroma and vessels among the epithelial glands. 
Chemicals 

25 Solvents, chemicals (except otherwise noted), and Hanks' Balanced Salt Solution were obtained from 

Sigma- Aldrich (Oakville, ON, Canada). All reagents and buffers for the cleavable ICAT sample preparation 
procedure were from Applied Bios3^tems (Foster City, CA, USA). 
Sample Preparation 

After removal of cell debris by centrifugation, the total protein content for each of the six clarified 
3 0 homogenates was measured using a commercially available Bradford protein assay (Bio-Rad, Mississauga, ON, 
Canada). ICAT sample preparation procedure was carried out according to the cleavable ICAT protocol 
(Applied Biosystems, Foster City, CA, USA). One hundred micrograms of proteins was used per sample. The 
proteins were denatured with the Denaturing Buffer supplied with the ICAT reagent kit. Afterwards, disulfide 
bonds were cleaved by adding the reducing reagent supplied, which contained 50 mM tris- 
35 (carboxyethyl)phosphine, and boiling for 10 min. The proliferative samples were then labeled with the light 
ICAT reagent and the secretory samples with the heavy reagent by incubating with the respective ICAT label for 
2 h at 37*^0. The labeled PRO 1 and SECl samples were combined to form ICAT sample A, PR02 and SEC2 to 
form ICAT sample B, and PR03 and SEC3 to form ICAT sample C. The final volume of each sample was 0.2 
ml. Mixing of the labeled proliferative and secretory samples in pairs in this manner ensures that any protein or 
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peptide losses during subsequent processing steps is the same for both samples in a pair. The sample pairs were 
then digested by incubating each pair with 0.2 ml of a 50 [ig/ml trypsin solution at 37°C overnight. Tlie resulting 
peptides were fractionated by means of strong cation exchange chromatography using an HP 1050 LC system 
(Agilent, Palo Alto, CA, USA) with a 1 .5 ml injection loop and a 2. 1 mm internal diameter (ID) x 100 mm length 
5 PolyLC Polysulfoethyl A column packed with 5 \im beads with 300 A pores (The Nest Group, Southborough, 
MA, USA). A 2. 1 mm ID x 10 mm lengdi guard column of the same material was plumbed upstream from the 
analytical column. Fractionation was effected by a binary mobile-phase gradient at a total flow rate of 0.2 
ml/min. Eluent A consisted of a 10 mM KH2PO4 solution in 25% acetonitrile and 75% deionized water acidified 
to a pH of 3.0 with phosphorus acid. Eluent B consisted of a 1 0 mM KH2PO4 and 350 mM KCl solution in 25% 

10 acetonitrile and 75% deionized water acidified to a pH of 3.0 with phosphorus acid. The trypsinized ICAT 
sample pair (0.4 ml) was mixed with 2.0 ml of Eluent A. 1.5 ml of the 2.4 ml sample was injected. Initially, the 
gradient comprised 100% Eluent A. At the 2""^ minute, the % Eluent B was changed linearly from 0 to 100% at 
the 58*^ min. The run was terminated at the 60*^ min. A total of 30 fractions were collected, one every two 
minutes, using an SF-2120 Super Fraction Collector (Advantec MFS, Dublin, CA, USA). UV monitoring of the 

1 5 chromatographic eluent revealed abundant peptides in fractions 1 1 -20. These 10 fractions were ftirther purified 
by affinity chromatography according to the ICAT protocol recommended by Applied Biosystems. Eluted 
labeled peptides were treated with the Cleaving Reagent, which contains TFA in order to remove the biotin label, 
and resolved in a third stage of chromatographic separation using reverse-phase nanobore LC with online 
MS/MS. 

20 Nanobore LC/MS/MS 

The nanobore LC system was from LC Packings (Amsterdam, The Netherlands) and consisted of a 
Famos autosampler and an Ultimate Nano LC system. It was interfaced to an API QSTAR Pulsar QqTOF mass 
spectrometer (Applied Biosystems/MDS Sciex, Foster City, CA, USA) using a Protana NanoES ion source 
(Protana Engineering A/S, Odense, Denmark). The spray capillary was a PicoTip SilicaTip emitter with a 10 \im 

25 ID tip (New Objective, Woburn, MA, USA). The nanobore LC column was a 75 |im ID x 150 mm length 
reverse-phase PepMap CIS nano capillary column (LC Packings, Amsterdam, The Netherlands) packed with 3 
|Lim beads with 100 A pores. One \i\ of sample was injected via the full-loop mode. Separation was performed 
using a binary mobile-phase gradient at a total flow rate of 200 nl/min. Eluent A consisted of 94.9% deionized 
water, 5.0% acetonitrile and 0.1% formic acid (pH « 3). Eluent B consisted of 5.0% deionized water, 94.9% 

30 acetonitrile and 0. 1% formic acid. The following binary gradient was used: 

Time (min) 0 5 125 135 157 160 163 190 
% Eluent B 5 5 30 60 80 80 5 Stop 

For nanospray, the source conditions were a curtain-gas setting of 20 and an ionspray voltage in the 
range of 1800 - 3800 V that was optimized daily. In the QO region, the instrument parameters were a 
35 declustering potential (DP) of 65 V and a focusing potential (FP) of 265 V. Nitrogen was used as the collision 
gas at a setting of CAD = 5 for both TOF-MS and MS/MS scans. 
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All LC-MS/MS data were acquired in information-dependent acquisition (IDA) mod© in Analyst QS 
SP5 with Bioanalyst Extension 1.1 (Applied Biosystems / MDS Sciex). A TOF-MS survey scan with an m/z 
range of 400- 1500 and 1 s scan time was followed by two product ion scans with an m/z range of 70-2000 and 
2 s scan time. The collision energy (CE) was automatically controlled by the IDA CE Parameters script. The 
5 switching criteria were set to ions greater than m/z = 400 and smaller than m/z = 1500 with a charge state of 2 to 
5 and an abundance of > 10 counts/s. Former target ions were excluded for 60 s and ions within a 4 Th window 
were ignored. In addition, the IDA Extensions II script was set to 2 repetitions before dynamic exclusion and to 
select a precursor ion nearest to a threshold of 15 count/s every 4 cycles. These settings ensured examination of 
not only high abundance ions, but low abundance ones as well. Each of the three sample pairs. A, B and C, was 

10 injected twice, thus yielding a total of six sets of runs. 

Data analysis was performed using ProICAT 1.0 SP3 software (Applied Biosystems / MDS Sciex). An 
initial setting of a confidence value of 75 and a score of 15 was used. Relative quantification of proteins between 
the light and heavy labels was performed on the TOF-MS scans by calculating the relative areas under the series 
of peaks. Differential levels of expression were based upon measured protein expression ratios in proliferative 

1 5 versus secretory endometrium normalized to glyceraldehyde-3-phosphate dehydrogenase (GAPDH) expression 
and were considered significant when they exceeded three SDs &om the mean expression of housekeeping 
proteins (actin, tubulin a-chain and triose phosphate isomerase). All expression ratios were based on a mean of 
two essays. The sample size calculation for determination of minimum numbers of tissue samples for ICAT 
analysis (N) was based upon the difference between two means, N = 8 (SD / precision) [85]. Based on 

20 preliminary experiments, an SD value of 0.3 was observed with a target precision of 0.5, yielding a minimum 
sample size of 3 for each tissue group. The significance of differential protein expression between proliferative 
and secretory endometrium was evaluated using the Wilcoxon rank sum test. 
Results 

Figure 1 8 shows a typical nano LC/MS total ion chromatogram (TIC) fi-om firaction 16 of sample A. As 
25 a total of 10 fractions were analyzed per sample and each sample was injected twice, there were a total of 60 
TICs. The use of the IDA mode generated hundreds of MS/MS spectra per TIC. 

Using the initial setting of a confidence value of 75 (ProICAT recommends a lower confidence value of 
50) and a score of 15 in ProICAT 1.0 SP3 results in the identification of 346 non-redundant proteins. Manual 
inspection of the MS/MS spectra reveals that identification of some proteins that score between 15 and 20 may 
30 be problematic. A random selection of 50 spectra fi*om proteins scoring between 1 5 and 20 shows that 15 were 
analyzed and sequenced correctly, 13 were probably correct, and 22 were incorrect. Manual inspection of the 
MS/MS spectra of the 145 proteins that score 20 or higher also reveals occasional errors in assignment, but at a 
rate considerably lower than that between scores 15 and 20. Unfortunately, the use of ICAT-labelingresults in a 
reduced opportunity for corroborating identification based on a second tryptic peptide firom the same protein. 
35 Although the majority of human proteins do contain more than one cysteine residue per protein, some of these 
residues may be located within tryptic peptides that are too small or large, others may fall into peptides that have 
poor ionization efficiencies. Of the 346 proteins, only 23 were identified with more tlian one peptide; 15 of these 
23 proteins were identified with two peptides. 
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Table 6 lists the 119 proteins that have been identified by ProICAT and verified by manual inspection of 
MS/MS data. Figure 19 shows the distributions of the proteins in the form of a pie chart. Of the 1 19 proteins, 15 
were added after manual inspection of 50 randomly selected proteins' MS/MS spectra. If the MS/MS spectra of 
the remaining 151 proteins was inspected, it would have been expected to confirm the identifications of an 
5 additional ~ 45 proteins. This, however, was considered impractical, especially after it was known that none of 
the 1 5 additional proteins were observed in all three runs and would contribute towards knowledge of differential 
expression. 

The results show that expression levels of the majority of proteins identified are not consistently 
different between the proliferative and secretory phase of the endometrial cycle. As expected, the majority of 

1 0 identified proteins fall under the metabolic/housekeeping and structural categories. The proteins classified under 
"Others" include antibodies. A small percentage of the proteins are viral or pathogenic in origin, reflecting the 
possibility of infection in the biopsied endometria. A few proteins have no known functions or were identified 
from cDNA matches; these are, therefore, classified as hypothetical. 

To normalize small variations of protein amounts, relative quantification of the protein levels is 

15 normalized to the ratio of GAPDH. Gene expression studies involving different stages in the endometrial cycle 
have used GAPDH expression to normalize differential mRNA expressions [88-90]. The assumption is that the 
constant mRNA level of GAPDH through the endometrial cycle translate to constant protein level (no variation 
caused by translational controls). The GAPDH ratio of 1.5 dh 0.5 (one SD) before self-nomialization in the six 
runs is in accordance with this assumption. 

20 The ratios of the proteins in the secretory / proliferative phase (heavy / light label) listed in Table 6 are 

the averages ± SDs of the peptides in all the identified runs. In the case of more abimdant proteins, e.g. serum 
albumin and serotransferrin, individual peptides were detected in as many as 3-4 fractions. Less abundant 
proteins, however, were firequently detected only in 1-2 fractions (see Figure 20). In cases of multiple fractions, 
multiple peptides, and/or multiple runs, the ratios listed in Table 6 are the averages and the SDs of all 

25 contributing elements. Twenty four proteins show "extreme" differential expression in that only one labeled 
form was observable (these are noted as "Singleton H or L". However, only five of these 24 proteins were 
observed in all three samples (and an additional one in two samples). All five proteins were identified by single 
tryptic peptides. The expression levels of three housekeeping proteins (actin, tubulin a-chain and triose 
phosphate isomerase) normalized against GADPH were pooled to establish a mean expression level of 1 .07 1 . 1 

30 and an SD of 0.3. The SD of these expression levels was calculated firom expression data normalized to a value 
of 1 for any expression ratios less than 1 to generate linear data. (This was done by taking the reciprocal of 
expression values less than 1 in the original data set; for example, an expression ratio of 0.7 was converted to its 
reciprocal 1.42.) This data was then used to calculate the SD of housekeeping protein expression. A significant 
differential expression value was, therefore, defined as a normalized expression ratio that was three SDs above 

35 the ratio of 1.0. On the basis of the observed data, this ratio was set as 2.0 (approximately 1.0 + 3(0.3)). 
Consequently, only two- fold or larger changes in expression ratios were considered to be of significance, based 
on the analysis of the variation in measured expression of housekeeping proteins. An additional stringency 
requirement of this level of expression in all three sample pairs was imposed due to the small initial sample set. 
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No additional proteins in Table 6 meet the differential expression criteria of abundance ratios larger than 2.0 or 
smaller than 0.5 in all three sample pairs. 

An example of extreme differential expression observable in ail tliree sample pairs is shown in Figure 
21, which shows very significant expression enhancement of glutamate NMDA receptor subunit zeta 1 precursor 
5 in the secretory endometrium. The absence of the light-labeled analogue, which would have manifested as a 
cluster of peaks beginning at 578.6 Th, demonstrates the significantly lower expression of this protein in the 
proliferative endometrium. The triply charged tryptic peptide at 581.6 Th was identified as 
LLTLALLFSCS VAR [SEQ ID N0.28] , which maps to the N-terminal region of the protein. A second example 
is shown in Figure 22, again showing extreme differential expression in the secretory phase. This protein was 
10 identified as FRATl. For all five proteins that met the differential expression criteria stipulated above, a 
minimum p value of less than 0.04 was obtained on the basis of the Wilcoxon rank sum test, suggesting that the 
differences observed were of statistical significance. 
Discussion 

The observation of similar expression levels in the majority of proteins shown in Table 6 is not 

15 surprising, as many proteins are structural and housekeeping in nature. In addition, structural and housekeeping 
proteins, e.g. tubulin and actin, are expressed in many tissue types. This is of significance as the biopsies that 
were examined comprise not only epithelial cells, but also supportive stroma cells, blood and vessels that are not 
expected to be affected by the endometrial cycle. 

In tliis initial examination of three sample pairs, a conservative strategy was adopted of recognizing 

20 proteins as differentially expressed only when their expression ratios are larger than 2. 0 or smaller than 0. 5 in all 
three sample pairs. Of the 119 proteins listed in Table 6, five proteins are in this category. In fact, differential 
expression is in the extreme form that only single labels are apparent. There is one protein (POL polyproteiti) 
that was seen only in the secretory endometrium, but was observed in only two sample pairs and does not meet 
the criteria of differential expression in this study (see later). 

25 One of the five differentially expressed proteins is glutamate NMDA receptor subunit zeta 1 precursor, 

which was only observed in the secretory endometrium. This protein is a subunit of a ligand gated ion channel 
and is known to be involved with synaptic plasticity in neurons. A recent paper suggests that it may also play a 
role in glutamate-mediated toxicity to mitochondria, eventually leading to apotosis [91]. As stated earlier, the 
peptide maps to the N-terminal region of the precursor, which is normally cleaved to form the mature protein. 

30 Similarly, FRATl is detected only in the secretory endometrial samples under the experimental 

conditions. There is no direct connection between FRATl and the endometrium reported in the literature; 
however, FRATl is expressed in a wide range of human tissues [92]. The gene expressing FRATl is a known 
proto-oncogene that activates the WNT pathway [93,94]. This pathway is important in the endometrial cycle; 
FRATl is loiown to inhibit c-Jun activity, thereby inhibiting subsequent apotosis [95]. c-Jun, in turn, has been 

35 shown to be expressed in the endometrium and this expression is at a higher level during the proliferative phase 
than the secretory phase [96]. 

Myosin light chain kinase 2 is another protein that was observed exclusively in the secretory 
endometrium in all three sample pairs. While there are no studies on the relative amounts of this enzyme during 
the endometrial cycle, previous studies showed that it is present in the human myometrium at a level that is four- 
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to five-fold higher than that in endometrial stromal cells [97]. It was also demonstrated that the specific activity 
of myosin light chain kinase, which is believed to be essential for smooth muscle contraction, was 20-fold higher 
in the myometriuim than that in non-muscle cells, e.g. skin fibroblasts [97]. 

Isopentenyl diphosphate delta-isomerase, was present only in the secretory endometrium. This protein 
5 was identified via the peptide ITMLCTGSRTLK [SEQ ID NO. 29]. In addition to this identification with 
ProICAT, a subsequent search with Mascot (Matrix Science) verified this assignment This peptide, and 
therefore the protein, however, is unique and found only in the bacterium Rickettsia conorii. BLASTing the 
human and the R. conorii isopentenyl disphosphate delta-isomerase proteins showed no region of homology. In 
addition, MS BLASTing the peptide ITMLCTGSRTLK against non-redundant protein databases returned no 
10 other matches. 

A protein that appeared in all three sample pairs was initially identified by ProICAT as a simian 
immunodeficient viral protein, envelope polyprotein GP160 precursor. However, searching Mascot with the 
same MS/MS spectra returned a different protein from Clostridium, putative phosphatidylserine decarboxylase 
proenzyme (gi 28209853). Endometrial receptors for a Clostridium toxin have been documented [98]; thus the 
15 identification of a Clostridium protein is not entirely surprising. 

Another protein initially identified by ProICAT as POL polyprotein from Rous sarcoma virus was 
observed only in the secretory endometrium in two of the three sample pairs. A subsequent Mascot search, 
however, retumed a positive hit for an unlabeled peptide from serum albumin, and strongly suggested a wrong 
initial identification. 

20 Tropomyosin 1 alpha chain was identified in two out of three sample pairs to be selectively expressed in 

the secretory phase at a ratio of 2.3 ± 0.7. An enhanced expression of this protein in the secretory endometrium 
is in accordance with the result of a previous gene expression study that reported an up-regulation of 3.7 times 
[85]. 

A protein that showed no difference in expression levels (1.2 ± 0.7 in three samples) is macrophage 
25 migration inhibitory factor, MIF. This observation is consistent with those in previous studies that demonstrated 
MIF as being expressed by the human endometrium throughout the menstrual cycle and, in particular, 
predominantly in the glandular epithelial cells [9] . MIF was found to localize throughout the glandular epithelial 
cytoplasm in the proliferative phase; however, this distribution changes in the secretory phase, when MIF 
localizes to the apical portion of the glandular epithelial cells and is also present in glandular secretions. 
30 Macrophages are common in female reproductive tissues. In the endometrium, they play an important role in 
defense. In addition, macrophage degradation of cellular debris and foreign material may play an important role 
in endometrial shedding and repair [9]. 

Another protein observed in this study and for which there was precedence in the literature is 
glycodelin. On the basis of gene expression studies [85], glycodelin was believed to be up-regulated during the 
35 secretory phase. In this study, the precursor of this protein was identified in only one sample pair, but it was 
identified in both runs in only the secretory endometrium (Figure 23). 

Cathepsin B which, according to a previous study, does not appear to be differentially regulated during 
the menstrual cycle [99], was seen in two of the sample pairs with opposite trends (Figure 24). Glutathione S 
transferase is known to show large variability which is believed to result from individual differences, rather than 
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endometrial cycling [100]. The relative expression level observed in this study is 1.3 ± 0,4. Lactate 
dehydrogenase is another protein not believed to have any cyclic trend [101]; the relative level measured in this 
study, 1.4 d: 0.3, is in accordance v/ith this expectation. Peptidyl-propyl cis-trans isomerase A or cyclophilin A 
has been observed in placental and decidual tissues, but there was no study on relative amounts between the 
5 phases of the endometrial cycle [102]. The relative expression results of 1.2 ± 0.2 show a constant level of 
expression. Haptoglobin is thought to be up-regulated during the secretory phase [ 1 03] . The average relative 
expression level of 2.2 ± 1.6 is based on two enhanced secretory expression level measurements in two sample 
pairs and one reduced secretory expression level measurement in another sample pair that account for the very 
large SD. Haptoglobin is a hver protein secreted into blood. As tlie amounts of blood in the biopsied tissues 

10 could not be controlled prior to homogenizing, the large SD might simply reflect that the samples contain 
different amounts of blood as opposed to truly representing increased or decreased haptoglobin expression. 

It should perhaps be emphasized that protein identifications using ICAT-labeled peptides are reliable as 
the identification of one labeled peptide in a pair corroborates the identification of the other in the pair. This 
offsets somewhat the reduced opportunities in seeing multiple peptides because ICAT targets only cysteine- 

15 containing peptides. The chance of a missed identification increases when only one member of the labeled 
peptide pair is observable in tlie form of an extreme differential expression and there is uncertainty as to whether 
the observable peptide is labeled with the light or the heavy ICAT reagent In addition, the presence of an 
unlabeled peptide may be mistakenly interpreted as a labeled peptide that is differentially expressed in the 
extreme. An unlabeled peptide will need to have a combination of "right" chemical properties and very high 

20 initial concentrations (e.g. a serum albumin peptide) to "survive" tlie affinity chromatography stage intended to 
select only labeled peptides. The above is intended to be a cautionary note in pointing out potential shortcomings 
in a powerful technique that readily highlights and quantifies differential protein expressions, and not as a 
negative opinion on the suitability and utility of ICAT in protein discovery or quantification. 

Of the 46 studies published to date that centered on ICAT, the vast majority were on methods 

25 development that compared differential protein expressions in two different cell states. Most of these studies 
were on yeast cultured under different medium conditions and they compared protein levels of one sample pair. 
Only one recent study [104] targeted clinical samples, and even in this study only one sample pair was analyzed 
for differential protein expression. The present study is the only work in which multiple sample pairs have been 
examined. It is, of course, possible that a given observed deviation in protein ratio fi*om 1, i.e. a heavy/light label 

30 ratio 7^ 1, is due to individual differences, and that different pairing of the proliferative and secretory samples 
may produce a different ratio. It is because of this possibility that the criterion for differential expression was 
stringently set to a ratio of larger than 2. 0 and smaller than 0.5 m all three sample pairs. It turns out that for the 
five proteins that met the above criterion of differential expression, the differences were in the extreme in that the 
proteins were observed in only one endometrial phase in a pair. Expression differences at this extreme level in 

35 all three sample pairs are highly unlikely to have originated from individual differences. 
Conclusion 

From an initial list of 346 proteins identified by ProICAT 1.0 SP3, 119 proteins were identified after 
manual inspection of MS/MS data and additional searches using Mascot. The expression levels of the majority 
of proteins do not vary significantly between the proliferative and tlie secretory phases. Only five proteins 
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satisfy the criteria on differential expression in having an expression ratio larger than 2,0 or smaller than 0.5 in 
all three sample pairs. In fact, the differential expressions observed are extreme in that the proteins are present in 
only one of the two endometrial pliases. The two proteins with unquestionable differential expressions in the 
secretory endometrium are: glutamate NMDA receptor subunit zeta 1 precursor and FRATl. Some of the 
5 proteins that show no differential expression have previously been examined in gene-expression studies with 
similar conclusions. 
Example 6 
Abstract 

SELDI MS has conventionally been practiced on linear TOP which has low mass accuracy and 

10 resolution. Here in an examination of both malignant and nonmalignant endometrial tissue homogenates it is 
demonstrated tliat high mass accuracy and resolution in the MS stage are crucial. Using a commercially 
available QqTOF, two potential cancer markers were resolved and subsequently identified offline as chaperonin 
10 and calgranulin A, that differ by 8 Da in mass. Two offline protein identification protocols were developed: 
the first was based on SEC, SDS PAGE, protein extraction, trypsin digestion and MALDI-MS/MS; the second on 

15 SEC and shotgun nanoLC/MS/MS. Analyses on a cohort of 44 endometrial homogenates showed 22 out of 23 
nonmalignant samples had nondetectable to very low abundance of chaperonin 10 and calgranulin A; 17 of the 
21 malignant samples had detectable to abundant levels of both proteins. Immunohistochemical staining of a 
tissue microarray of 32 samples showed that approximately half of malignant endometrial tissues exhibited 
positive staining for calgranulin A in the malignant cpithehum, while 9 out of 10 benign tissues exhibited 

20 negative epithelial staining. In addition, macrophages/granulocytes in malignant as well as nonmalignant tissues 
showed positive staining. No immunostaining occurred in stroma or myometrium. 

The following materials and methods were used in the study. 
Materials and methods 
Tissue samples and sample preparation 

25 Endometrium and EmCa tissues were retrieved from a dedicated, research in-house endometrial tissue 

bank. The consenting and tissue banking procedures for this tissue bank were approved by the Research Ethics 
Boards of York University, Mount Sinai Hospital, University Health Network, and North York General Hospital. 
All tissues had been snap frozen in liquid nitrogen within 15-20 minutes of devitalization at the time of 
hysterectomy, and were obtained with patients' consent. In each case, the endometrium was classified as 

30 nonmalignant or malignant (EmCa) by one of the authors (TJC) [Kurman, R.J., Ed., Blaustein»s Pathology of the 
Female Genital Tract, 5^ ed. New York, Springer- Verlag, 2002], Nonmalignant endometrium included both 
benign physiologic endometrial cases (atrophic, proliferative, secretory, and menstrual) and pathologic states 
(benign endometrial polyp and disordered proliferative). EmCa cases included adenocarcinomas and rarer 
malignant mixed Mullerian tumors (carcinosarcomas). This classification was performed using routine surgical 

35 pathology sections. The histologic classification of the research tissue was verified by examination of a 
histopathologic section JSrom the frozen research tissue. Tissue was taken for proteomic analysis from the mirror- 
face of the residual block. Cases of atypical endometrial hyperplasia, which could be considered to represent an 
intermediate phenotype, were not available for frozen-tissue banking and MS analysis because all tissue was 
required for histopathologic examination, but tissue was available for tissue microarray (TMA) preparation. 
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I 

Tissue was thawed in Hanks' balanced salt solution (HBSS, Sigma) containing protease inhibitors (1 
mM 4-(2-aininoethyl)benzenesulfonyl fluoride, 10 [jM leupeptide, 1 jag/ml aprotinin and 1 pepstatin) and 
followed by mechanical homogenation at 30,000 rpm using a Polytron PT 1300D handheld homogenizer 
(Brinkmann, Westbury, USA). These whole tissue homogenates contained not only endometrial epithelium, but 
5 supportive stroma, vessels, as well as secretions. The homogenate was then stored in aliquot at -80 °C until 
protein profiling. 
Chemicals 

Solvents, chemicals (except otherwise noted), and Hanks' Balanced Salt Solution were obtained from 
Sigma-Aldrich (Oakville, ON, Canada). 

10 Protein profiling using SELD-MS 

Tissue lysate was fractionated to reduce sample complexity before protein profiling. An identical 
quantity of proteins was used for all samples; protein amounts were measured using Bradford Assay (Bio-Rad) 
on a DU-65 spectrometer (Beckman) at 595 nm. HBSS or urea/thiourea 2-D lysis buffer (7 M urea, 2 M 
thiourea) was used to compensate the initial volume to ensure equal volumes for all samples. Two micrograms 

15 of proteins from endometrial tissue homogenate was incubated with WCX2 or CMl 0 ProteinChips (Ciphergen) 
according to the manufacturer's instructions. In brief, samples were diluted to 55 \i[ with the corresponding 
binding buffer, spotted onto the appropriate ProteinChip, and incubated in a sealed BioProcessor (Ciphergen) for 
1 h at room temperature. The ProteinChip surface was washed twice with the appropriate buffer for 5 minutes, 
briefly rinsed with water and air-dried. Two times 0.5 jil of 50% saturated sinapinic acid in 50% acetonitrile was 

20 applied on the samples to form crystals. The ProteinChips were analyzed using a linear TOP analyzer, PBS lie 
(Protein Biology System lie, Ciphergen), or a hybrid QqTOF mass spectrometer, QSTAR XL (Applied 
Biosystems / MDS Sciex) fitted with the SELDI interface (Ciphergen). For accurate molecular-weight 
measurements, on-chip, internal mass calibration was performed using insulin (5,733 Da), cytochrome c (12,36 1 
Da) and myoglobin (16,95 1 Da) as calibrants. 

25 Protein purification and identification 

Selected EmCa samples were subject to chromatographic separation to yield purified proteins for 
identification. Five hundred micrograms of proteins from a whole tissue homogenate was first fractionated using 
SEC (BioSep 2000, Phenomenex) at 1 ml/min flow with phosphate buffer (pH 7.9) containing 0.05% (w/v) 
sodium azide. One-millilitre fractions were collected; the eluates were then concentrated to 50 \xl with a silicon 

30 carbide-based spin column (ProteoSpin, MDS Sciex). Five microlitres of a concentrate was desalted by C 1 8 Zip- 
Tip and analyzed with MALDI-MS to locate the fractions containing the protein markers of interest in tlie EmCa 
sample. Two different approaches were developed and used to identify the target proteins. Some proteins were 
more amenable to identification with one and others with the second approach. The first approach used SDS 
PAGE technique to further purify the target proteins. The fractions with the enriched proteins were diluted to 100 

35 p,l with freshly prepared DTT (5 mM final) in 150 mM Tris pH 8.5 buffer, and incubated at 60 °C for 1 h. Ten 
microlitres of the reaction mixture was desalted by CIS Zip-Tip and analyzed with MALDI-MS to assess the 
effect of DTT on the protein of interest. The remaining 90 |j.l was precipitated by acetone (80% (v/v) final), 
resuspended in SDS sample buffer, and the proteins were resolved by SDS PAGE. Protein molecular- weight 
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markers (New England Biolabs) and cytochrome C (C-2506 Sigma) were included to guide the excision of gel 
portions containing the protein of interest. Intact proteins were extracted from the gel by 50 (al extraction 
solution (formic acid/acetonitrile/isopropanol/water in a ratio of 50/25/15/10) at room temperature for 4 h. The 
extracts were completely dried by SpeedVac and resuspended in 40 \il 100 mM ammonium bicarbonate. Half of 
5 the resuspended proteins was desalted by CI 8 Zip-Tip and analyzed with MALDI-MS, the other half was 
digested in solution with 100 ng trypsin (Promega). The identity of the target protein was determined using 
MS/MS ion search (ProID, Applied Biosystems). The second approach used nanoLC/MS/MS technology. After 
concentration using ProteoSpin, the fractions with the enriched target proteins were ftirther cleaned up using CIS 
Zip-Tips. Four CIS Zip-Tips were used for each 45 p.1 of concentrated fractions. The sorbed proteins were eluted 
10 by 100 |al of 60% acetonitrUe with 0.3% TFA. The eluates were completely dried by SpeedVac and resuspended 
in 100 jil of 100 mM ammonium bicarbonate. The proteins were then digested in solution with 100 ng trypsin 
(Promega). Tlie resulted tryptic peptides were analyzed by MALDI-MS and online nanoLC/MS/MS. The 
identity of the target protein was determined using MS/MS ion search (ProID, AppHed Biosystems). 
NanoLC/MS/MS 

15 The nanoLC syistem was from LC Packings (Amsterdam, The Netherlands) and consisted of a Famos 

autosampler and an Uhimate nanoLC system. It was interfaced to a second QSTAR Pulsar hybrid QqTOF fnass 
spectrometer using a Protana NanoES ion source (Protana Engineering A/S, Odense, Denmark). The spray 

I 

capillary was a PicoTip SilicaTip emitter with a 10 \im ID tip (New Objective, Woburn, MA, USA). The 
nanoLC column was a 75 jim ID x 150 mm length reverse-phase PepMap CIS nano capillary column (LC 
20 Packings, Amsterdam, The Netherlands) packed with 3 pim beads with 1 00 A pores. One microliter of desalted 
sample was injected via tlie full-loop mode. Separation was performed using a binary mobile-phase gradient at a 
total flow rate of 100 nl/min. Eluent A consisted of 94.9% deionized water, 5.0% acetonitrile and 0. 1% formic 
acid (pH « 3). Eluent B consisted of 5.0% deionized water, 94.9% acetonitrile and 0.1% formic acid. The 
following binary gradient was used: 

25 

Time(min) 0 10 40 42 44 45 70 72 
% Eluent B 5.0 5.0 30.0 80.0 80.0 5.0 5.0 Stop 

The ion source conditions were a curtain-gas setting of 20 and an ionspray voltage in the range of 1 800 - 
3 800 V that was optimized daily. In the QO region, the instrument parameters were a declustering potential (DP) 
of 65 V and a focusing potential (FP) of 265 V. Nitrogen was used as the collision gas at a setting of CAD = 5 

30 for both TOF-MS and MS/MS scans. 

All LC-MS/MS data were acquired in information-dependent acquisition (IDA) mode in Analyst QS 
SP5 with Bioanalyst Extension 1.1 (Applied Biosystems / MDS Sciex). A TOF-MS survey scan with an m/z 
range of 400 - 1500 and 1 s scan time was followed by two product ion scans with an m/z range of 70 ~ 2000 and 
2 s scan time. The collision energy (CE) was automatically controlled by the IDA CE Parameters script. The 

35 switching criteria were set to ions greater than m/z = 400 and smaller than m/z = 1500 with a charge state of 2 to 
5 and an abundance of > 10 counts/s. Former target ions were excluded for 60 s and ions within a 4 Th window 
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were ignored. In addition, the IDA Extensions II script was set to 2 repetitions before dynamic exclusion and to 
select a precursor ion nearest to a threshold of 1 5 count/s every 4 cycles. These settings ensured examination of 
not only high abundance ions, but low abundance ones as well. 
Tissue microarray and immunohistochemical protein localization 
5 A 32-core TMA was prepared to confirm the presence of an identified protein marker (10,835 Th, see 

later sections), and to determine its localization. Following fixation in 10% buffered formalin, benign, 
hyperplastic and malignant endometrial tissues were histologically processed and embedded in paraffin blocks. 
A single 2-mm core of tissue was removed from the paraffin block using a Manual Tissue Arrayer (Beecher 
Instruments). The 26 endometrial study cases included 11 benign endometrium (five secretory, four 

10 proliferative, one atrophic and one menstrual), 13 malignant endometrium (12 adenocarcinomas and one 
carcinosarcoma), and two cases of atypical endometrial hyperplasia. Six other control tissues were also 
embedded into the TMA, including four non-endometrial carcinomas and 2 benign tumors (salivary gland 
pleomorphic adenoma and ovarian thecoma). Following MS identification, monoclonal mouse antibodies 
against the potential cancer marker were procured fi-om two sources: Research Diagnostics, Inc. (Flanders, NJ), 

15 labeled as RDI, and US Biological (Swampscott, MA), labeled as USB. Sections were cut from the TMA. The 
antibodies were applied in an appropriate dilution determined through a pilot study and immunohistochemically 
visualized using a diaminobenzidine chromogen. 

Interpretations of the two immunohistochemically stained TMA sections were conducted using a 
standardized method; microscopic reviews were conducted blinded to the findings of the other TMA section. 

20 Positive staining (brown) for the potential marker was determined for three cellular components: 
macrophage/granuloc3^e, epithelium/carcinoma, and endometrial stroma and myometrium. Immunostaining of 
these three cell types was semi-quantitatively graded as: 0 (no immunostaining), 1+ (weak positivity and/or 
positivity in only occasional cells), 2+ (moderate positivity and/or positivity in more abundant cells), and 3+ 
(strong positivity and/or positivity in numerous cells). 

25 Results and discussion 
SELDI-MS details 

The use of SELDI-MS profiling in discovering a protein marker, chaperonin 10, for EmCa is described 
above. Chaperonin 10 has a molecular weight of 10,843 Da. Performing SELDI-MS on the higher resolution 
QqTOF mass spectrometer (QSTAR XL) was revealing: chaperonin 10 firequently appeared as three instead of a 

3 0 single peak (Figure 25a). On the lower resolution PBS lie TOF mass spectrometer, which is much more widely 
available and used, the peaks are unresolved. Incubating the tissue homogenate with hydrogen peroxide 
overnight reduced the abundance of chaperonin 10 while increased the abundance of the protein that is heavier 
by 1 6 Da (compare Figure 25b with 25c). Increasing the fiuence of the MALDI laser (from 34.6 to 76.9 p,J) had 
the effect of increasing the abundance of the protein peak that is 1 8 Th lower than chaperonin 10 in m/z value. In 

35 addition, a new protein peal^ that is 36 Th lower in m/z value emerged (compare Figure 25d with 25e). 
Incubating with hydrogen peroxide enhanced oxidation of chaperonin 10, probably the lone methionine residue; 
as a result, the molecular weight of chaperonin 10 increased by 16 Da. Increasing the laser intensity probably 
promoted dehydration of chaperonin 10 within the MALDI source. The sources of the water were likely the five 
glutamic and eight aspartic acid residues. Laser-induced deamidation (-17 Th) could also occur, but was ruled 



wo 2005/061725 PCT/CA2004/002172 



-81- 



out after careful and repeated measurements mass-calibrated with cytochrome c. Single measurements for 
chaperonin 10 on the QSTAR XL with external or internal calibrants carry a typical uncertainty of ± 2 Th. 

As stated earlier, the aforementioned details were unresolved on tlie PBS lie TOP mass spectrometer, 
wliich displayed only one broad hump. The poor resolution in this case has no negative consequence as all peaks 
5 are different forms of chaperonin 10, However, this deficiency in resolution (and mass accuracy) may be 
detrimental when the sample contains a second protein of similar molecular weight. Having screened in excess 
of 40 endometrial homogenates (see later), it was found that a significant fraction of EmCa homogenates carry a 
protein that is lighter than chaperonin 10 by 8 Da. Figure 26 shows selected SELDI MS profiles, from four 
EmCa (designated by "C") and two normal endometria (designated by "N"), as collected on the PBS lie TOP and 
10 the QSTAR XL QqTOF mass spectrometers. The pH of the washing buffer was 6.0. The PBS instrument 
exhibited better sensitivity than the QSTAR, However, the profiles generated by the former contain broad 
humps, while those by the latter exhibit considerable details (Figure 26B). In addition, the mass accuracy of the 
PBS is considerably inferior to that of the QSTAR. Sample C15 is the simplest of the four EmCa samples and 
contains only chaperonin 10, its oxidized (10,859 Th) and dehydrated (10,825 Th) forms (compare Figure 25). 
15 Sample C8 has a little chaperonin 10 and contains mostly this second protein (10,835 Th), its oxidized (10,851 
Th) and dehydrated (10,817 Th) forms. Similarly, sample C6 has some chaperonin 10 admixed with the second 
protein. In sample CI 8, the relative abundance of chaperonin 10 and the unknown protein is approximately 1 to 
1 . The normal samples show traces of chaperonin 1 0. The possibility of having two proteins that differ by 8 Da 
in EmCa samples was confirmed by repeating some experiments, using pH 7.0 washing buffer. In the latter 

20 condition, the ProteinChip binds only the unknown protein (Figure 27). The abundances of chaperonin 10 and 
this second protein in nonmalignant and malignant endometrial tissue homogenates are summarized in Tables 7 
and 8, respectively. As SELDI/MALDI analyses on the same sample exhibit considerable variation from shot to 
shot and from spot to spot, the scales adopted, from "0" (absencie) to "5+" (S/N > 10 or higher) are intended to 
convey semiquantitative rather than quantitative information. For nonmalignant samples, chaperonin 1 0 and the 

25 second protein are absent or present in only very low abundance ("1+") in 22 out of 23 cases; the exception, 
however, is different for the two proteins. For the malignant samples, two out of 21 cases show absence of both 
chaperonin 10 and the second protein; in addition, two cases show absence of chaperonin 10 but liigh abundances 
of the second protein, while another two cases show high abundances of chaperonin 10 but absence of the second 
protein. The absence of the potential cancer markers in some malignant tissue homogenates may be due to 

30 nonoptimal MS conditions and/or absence of tumors in the tissue block homogenized and analyzed. 
Identification of the unknown protein 

The vast majority of SBLDI-MS applications use spectral signatures from sera to differentiate normal 
and diseased states without actual protein identification [22, 25, 30, 105-108]. This strategy has come under 
criticisms and the clinical effectiveness of serum proteomics is currently being questioned [109-119]. The 

35 identification of chaperonin 10 in EmCa tissue homogenates as a marker for endometrial cancer is described 
above. The identification strategy was based on SEC, SDS PAGE isolation, protein extraction, trypsin digestion 
and MALDI-MS/MS identification. However, the preferred strategy is shotgun nanoLC/MS/MS after initial 
purification using SEC. 
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The unknown protein has a molecular weight of 10,835 Da, Using the Tagldent tool available a;t the 
ExPASy Website [133] with the following entries: molecular weight, 10,835 ± 2 Da (i 0.02%); pi, 7 ± 5; and 
organism classification, human to search the Swiss-Prot database that contains 15 1,047 entries, returned only two 
proteins, calgranulin A (accession number, P05 109) and small nuclear ribonuciear protein D homolog (accession 
5 number, Q9Y333). A sample was used that contained the 10,835 Da protein in abundance for SEC. This 
protein was found to have the highest concentration in SEC fraction 7. NanoLC/MS/MS of the tryptic peptides 
contained in this fraction identified a number of proteins (Table 9). The only protein that has the correct 
molecular weight is calgranulin A, 10,834 Da. The other proteins all have higher molecular weights (see later). 
Figure 28 shows MS/MS spectra of three tryptic peptides, (A) MLTELEK [SEQ ID NO. 50], (B) 
10 ALNSIIDVYHK [SEQ ID NO. 51] and (C) GADVWFK [SEQ ID NO. 52], that unambiguously identify 
calgranulin A. The first two peptides constitute residues 1—18, the third peptide residues 50-56. The three 
peptides constitute a sequence coverage of 25 out of a total of 93 residues or 27%. 
Calgranulin 

Calgranulin A is a member of the SlOO family of calcium-binding proteins [134]. It is also known as 

15 S 1 00 calcium-binding protein A8, migration inhibitory factor-related protein 8 (MRP-8), cystic fibrosis antigen 
(CFAG), PS, leukocyte LI complex light chain, and calprotectin LIL subunit. It has been reported as a specific 
protein for several diseases [135-156], but has not been implicated in EmCa. Calgranulin A can form 
homodimers as well as higher-order oligomers [139]. Calgranulin A has been found to be associated with a 
second protein in the SlOO family, calgranulin B (13,242 Da) [137,139]. This heterodimer is thought to be 

20 involved in inflammation, including the transmigration process of leukocytes and transport of arachidonic acid to 
target cells [ 134, 137]. In the present study, calgranulin B was also identified along with calgranulin A, albeit by 
only one peptide. The possibility that calgranulin A and B coexist as members of a larger unit is in accordance 
with the observation that calgranulin eluted early in SEC in fraction 7 along with other proteins of much higher 
molecular weights (Table 9). However, calgranulin B does not appear to be retained on the ProteinChips under 

25 conditions that are favorable for calgranulin A. 

Tissue microarray and immnnhlstochemical localization of calgranulin A 

A summary of the TMA immunohistochemical studies using two different monoclonal antibodies (RDI 
and USB) is shown in Table 10. There is great similarity/identity between the findings from the two 
immunohistochemical studies. Calgranulin A was localized to the malignant epithelium of endometrial 

3 0 adenocarcinoma and carcinosarcoma. Approximately half of these malignant endometrial tissues showed 1+ to 
3+ immunostaining for calgranulin A (Figure 29A and B). The most striking positivity was noted in the 
squamous areas of three endometrioid carcinomas (Figure 30). Two of four non-endometrial carcinomas also 
exhibited immunostaining for calgranulin A in the malignant epithelium. Immunohistochemical staining for 
calgranulin A was absent in 9 of 10 benign endometrial cases. A single case of menstrual endometrium exhibited 

35 focal immunostaining (1+). Macrophages/granulocytes in benign endometrium (Figure 31), malignant 
endometrium (Figure 29), andnon-endometrial carcinomas showed discrete strong immunostaining of variable 

4. 

numbers of cells with immunostaining ranging from 1+ to 3+. No immunostaining for calgranulin A was noted in 
stroma or myometrium. 
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The MS results presented here show that elevated levels of calgranulin A were detectable in about two- 
thirds of EmCa homogenate cases, with 14 of 21 cases demonstrating 2+ or greater abiindance for this protein 
(Table 8), By contrast, only minor levels were observed in most nonmalignant endometrium, with 22 of 23 cases 
demonstrating 1+ or lower abundance (Table 7). 

5 The TMA findings not only confirm the presence of calgranulin A, at least as assessed by 

immunohistochemistry, but also indicate two cellular sources for the MS findings. In EmCa, the sources are both 
the malignant epithelium (carcinoma) and macrophage/granulocytes, whereas in norunallgnant endometrium, 
macrophages seem to be the predominat source, and only rarely benign (menstrual) endometrium (Table 10). 
The TMA analysis identified calgranulin A not only in a greater proportion of the epithelium in the EmCa cases, 

10 but also with greater intensity, than in nonmalignant endometrial cases. Both cases of atypical hyperplasia 
showed 1+ immunostaining of the affected endometrial glands. 

Employment of calgranulin A, in combination with other protein markers (including chaperonin 10), 
may provide a robust endometrial cancer diagnostic and/or screening panel. The utility of any panel will increase 
with the number of markers employed, and the sensitivity and specificity of individual markers. This study 

15 demonstrates the complementary nature of MS and TMA in identifying and localizing sources of potential cancer 
markers in this search, and a possible utility of TMA in selecting cancer markers. 
Conclusion 

High mass accuracy and high resolution in the MS stage are crucial in SELDI MS. Using the QSTAR 
XL, two potential cancer markers were resolved and subsequently identified offline as chaperonin 10 and 

20 calgranulin A, that differ by 8 Da in mass. These two proteins were unresolved on the PBS lie mass 
spectrometer. Two offline protein identification protocols were developed: the first was based on SEC, SDS 
PAGE, protein extraction, trypsin digestion and MALDI-MS/MS; the second on SEC and shotgun 
nanoLC/MS/MS. Analyses on a cohort of 44 endometrial homogenates showed 22 out of 23 nonmalignant 
samples had nondetectable to very low abundance of chaperonin 10 and calgranulin A; 17 of the 21 malignant 

25 samples had detectable to abundant levels of both proteins. Immunohistochemical staining of a tissue microarray 
of 32 samples showed that approximately half of malignant endometrial tissues exhibited positive staining for 
calgranulin A in the malignant epithelium, while 9 out of 10 benign tissues exhibited negative epithelial staining. 
In addition, macrophages/granulocytes in malignant as well as nonmalignant tissues showed positive staining. 
No immunostaining occurred in stroma or myometrium. Calgranulin A, in combination with chaperonin 10 and 

30 other proteins, can constitute a panel of markers to permit diagnosis and screening of endometrial cancer. 
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Table 1 

Differentially Expressed Proteins in Endometrial Malignancies/Cancer 



Protein 



Chaperonin 10 



Calgranulin A 



Calgranulin B 



Poiymeric-immunoglobulin 
Receptor [precursor] 

Phosphatidyl ethanolamine- 
binding protein (PEBP) 

Acidic leucine-rich nuclear 
phosphoprotein 32 family 
member A 

Heat shock 70 kDa protein 6 



Macrophage migration 
Inhibitory factor (MIF) 



40 Calgizzarin (SIOOC protein) 



Triosephosphate isomerase 



Gene name 



HSPEl 



S100A8 



S100A9 



PIGR 



PEP 



ANP32A 



HSPA6 



MIF 



SlOOAll 



TPIl 



Alpha- 1 -antitrypsin precursor SERPINA 1 



Creatine kinase, B chain (B-CK) CKB 



Swiss-Prot Acc, No. 



Q04984 and AAH23518 
[SEQ ID 1] 
NM_002157and 
U07550 [SEQ ID 2] 

P05109 [SEQ ID 3] 
A12027 [SEQ ID 4] 
NM_002964 
[SEQ ID 5] 

P06702 [SEQ ID 6] 
X06233 [SEQ ID 7] 
M21064 [SEQ ID 8] 

P01833 or Q81ZY7 [SEQ ID 9] 
NM_002644 [SEQ ID 10] 

P30086 [SEQ ID 1 1] 
NM_002567[SEQ ID 12] 

P39687 [SEQ ID 13] 
NM_006305 [SEQ ID 14] 



PI 7066 [SEQ ID 15] 
NM_002155 [SEQ ID 16] 
X51757 [SEQ ID 17] 

P14174 [SEQ ID 18] 
NM_002415 [SEQ ID 19] 
L19686 [SEQ ID 20] 

P31949 [SEQ ID 21] 
NM„005620 and D38583 
[SEQ ID 22] 

P00938andNP 000356 
[SEQ ID 23] 

NM„000365 [SEQ ID 24] 
X69723 [SEQ ID 25] 

gi/1703025 

ITHU and P01009 [SEQ ID 30] 
NM_000295[SEQ ID NO. 31] 
K02212 [SEQ ID 32] 

gi/125294 

P12277[SEQ ID N0.33] 
NM_001823 [SEQ ID NO 34] 
X 15334 [SEQ ID NO 35] 



Expression 
in EmCa 



Up 



Up 



Up 



Up 



Up 



Up 



Up 



Up 



Up 



Up 



Under 



Under 
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10 



15 



Pyruvate kinase, Ml or M2 isozyme PKM2 



Transgelin (smooth muscle 
protein 22-alpha) 



Heterologous nuclear 
ribonucleoprotein D 



TAGLN 



(hnRPD 



gi/20178296; gi/125604; 
KPYl^HUMAN (P 1461 8) 
[SEQ ID NO. 36] 
X56494 [SEQ ID 37] 

gi/3 123283 
Q01995 [SEQ ID.38] 
D84342 [SEQ ID 39] 

ROD__HUMAN (Q14103) 
[SEQ ID NO.40] 
AF026126 [SEQ ID 41] 



Up 



Under 



Up 



wo 2005/061725 



- 86- 



PCT/CA2004/002172 



Table 2 

The identification of Ciiaperonin 10 by mass spectrometry and 
Western blotting techniques in non-malignant endometrial tissue homogeneates. 



Case 


Histopathologic 


MS results 






Classification 


(relative intensity) 


(r 


1 


Proliferative 




0 


2 


Proliferative 




o 


3 


Secretory 


+ 


o 


4 


Atrophic 




+ 


5 


Benign polyp 


o 




6 


Atrophic 


+ 


+ 


7 


Secretory 


o 


0 


8 


Disordered proliferative 
endometrium 


0 




9 


Secretory 


0 




10 


Disordered proliferative 


o 


+ 


11 


Secretory 


o 




12 


Secretory 


+ 


+ 


13 


Secretory 


o 


+ 


14 


Secretory 


+ 


o 


15 


Atrophic 


0 


0 


16 


Secretory 


+ 


+ 


17 


Menstrual 






18 


Secretory 


+ 


+ 


19 


Proliferative 


+ 


o 


20 


ProHferative 


+ 


+ 


21 


Menstrual 


o 


0 


22 


Atrophic 


o 


0 


23 


Atrophic 


++ 


+ 
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Table 3 

The identification of Chaperonin 10 by mass spectrometry and 
Western blotting techniques in malignant endometrial tissue homogeneates. 



Case 


Histopathologic Classification 


MS results 


Western Blotting Results 






(relative intensity) 


(relative intensity) 


24 


Em AdCa 1/3 


++ 


44- 


25 


Em AdCa 1/3 


O 


-1- 


26 


MMMT 


11111 

n 1 1 1 r 


-H-H-+ 


27 


Em AdCa 




++ 


28 


MMMT* 


+ 


o 


29 


Em AdCa 2/3 


H- 


+-H- 


30 


Em AdCa 2/3 


+ 


■1 1 M 


31 


Ser AdCa 




++++4- 


32 


Em AdCa 1/3 


. > 1 1 1 1 
in 1 11 


++-H-+ 


33 


Em AdCa Grade n.k. 


0 


+-H-f 


34 


Em AdCa Grade n.k. 


+ 


H-H- 


35 


Em AdCa Grade n.k. 


o 


-f 


36 


Em AdCa 1/3 


+4-H-I- 


1)11 


37 


Em AdCa 1/3 


+-H-I- 


-H-f 


38 


Em AdCa 1/3 




-HH- 


39 


Muc AdCa 1/3 


H — )~ 


0 


40 


Em AdCa 1/3 


+-H- 


H-H- 


41 


Em AdCa 1/3 


-H-+ 


-H- 


42 


Em AdCa - Ser AdCa 


-H-H-+ 


H-H-I-+ 


43 


Em** AdCa 1/3 


0 


■ 

o 




Em AdCa 2/3 


1 1 1 1 





5 *Mirror section showed minimal tumor. 

**Mirror image showed necrotic tumor only. 
Em AdCa = Endometrioid adenocarcinoma 
MMMT = Malignant Mixed Mullerian Tumor 
Muc AdCa = Mucinous adenocarcinoma 
10 n.k. = not known 

Ser AdCa = Serous adenocarcinoma 
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Table 4 



Differentially expressed proteins quantiHed by ITRA^Q. 
S: P is the ratio of secretory phase relative to the proliferative phase 
CI: P and C2: P are the ratios of cancer samples 1 and 2 relative to the proliferative phase, 

respectively- 



accession_id 

gi|461730 



gi|1703025 

gi| 125294 



gi 



20178296 or 
125604 



gi|3 123283 



name 

10 kDa heat shock protein, 
mitochondrial (HsplO) (10 kDa 
chaperonin) (CPNIO) 

Alpha- 1-antitrypsin precursor 

Creatine kinase, B chain (B-CK) 

Pyruvate kinase. Ml or M2 
isozyme 

Transgelin (Smooth muscle 
protein 22-alpha) 



S: P (± SD) 

1.06 ±0.12 



1.25 ± 0.39 
0.96 ±0.04 
1.03 ±0.11 

1.46 ±0.26 



C1:P(±SD) C2:P(±SD) 

2.71 ±0.52 2,23 ±0.29 



0.34 ± 0.07 
0.38 ±0.16 
2.75 ± 0.03 

0.26 ± 0.04 



0.44 ±0.16 
0.52 ±0.06 
2.02 ± 0.08 

0.45 ± 0.05 



I 
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Table 5 



Differentially expressed proteins quantiHed using cICAT. Singleton H signifies extreme 
Overexpression in the cancer sample relative to the normal proliferative control. 
Standard deviations were calculated from all three sample pairs. 



Accession # 

S 1 1 INHUMAN (P3 1 949) 



Name 

Calgizzarin (SIOOC protein) (MLN 70). 



Avg. C: N ratio (± SD) 
Singleton 



ROD_HUMAN (Q 14 1 03) Heterogeneous nuclear ribonucleoprotein DO 



4.62 (=fc 0.39) 



MIF__HUMAN (P14174) 



PEBP_HUMAN (P30086) 



Macrophage migration inhibitory factor 

(MIF) 

Phosphatidylethanolamine-binding protein 

(PEBP) 



KPY1_HUMAN (P14618) Pyruvate kinase, Ml isozyme (EC 2.7. 1.40) 



2.74 (i 0.92) 



3.72 (± 1.03) 



4.61 (± 1.40) 



KPY_HUMAN: 
PIGR_HUMAN (P01833) 



Polymeric-immunoglobulin receptor 
precursor 



9.83 (+/- 6.61) 



a: Calculation of SD is not possible 
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Table 6 



Proteins Identified and Secretory / Proliferative Expression Levels 



Accession # 

143T_HUMAN 
(P27348) 

RS12_ORE^^[ 
(013019) 

RS21_HUMAN 
(P35265) 

HPPD_MYCGR 
(042764) 

RL36_VIBCH 
(P78001) 

ACTA_HUMAN 

(P03996) or 
ACTS_HUMAN 
(P02568) 



Description Average H:L 

ratio 

14-3-3 protein tau (14-3-3 protein theta) 0.81 



40S ribosomal protein S12. 



40S ribosomal protein S21. 



0.52 



NQ 



4-hydroxyphenylpyruvate dioxygenase Singleton L 
(EC 1.13.11.27) 



50S ribosomal protein L36. 

Actin, aortic smooth muscle or Actin 
alpha skeletal muscle 



0.84 



1.86 



SD Frequency 



ACTB_HUMAN 
(P02570) 

SAH1_XENLA 
(P51893) 

AFAM_HUMAN 
(P43652) 

ADHL^HUMAN 
(PI 1766) 

DHALHUMAN 
(P00352) 

A1AG_HUMAN 
(P02763) 

A2HS_HUMAN 
(P02765) 

ALU5_HUMAN 
(P39192) 

AOP2_HUMAN 
(P30041) 

TRAI_AGRT5 
(P33907) 

APOH_HUMAN 
(P02749) 

ARK1„HUMAN 
(P25098) 



Actin, cytoplasmic 



Adenosylhomocysteinase 1 (EC 3.3. 1.1) 

Afamin precursor (Alpha-albumin) 
(Alpha-Alb). 

Alcohol dehydrogenase class III L chain 

Aldehyde dehydrogenase lAl (EC 
1.2.1.3) 

Alpha- 1 -acid glycoprotein 1 precursor 

(AGPl) 

Alpha-2-HS-glycoprotein precursor 
(Fetuin-A) 

Alu subfamily 

Antioxidant protein 2 (1-Cys 
peroxiredoxin) 

Autoinducer synthesis protein tral. 



Beta-2- glycoprotein I precursor 
(Apolipoprotein H) 

Beta-adrenergic receptor kinase 1 (EC 
2.7.1.126) 



0.93 



0.70 



NQ 



0.67 



0.60 



1.1 



0.82 



1.1 



NQ 



0.55 



0.58 



0.04 



1 



0.5 



0.54 



Singleton L 
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Accession# Description Average H:L SD Frequency 

ratio 

BPEA_HUMAN Bullous pemphigoid antigen 1, isoforms 0.99 1 

(094833) 6/9/10 

CLP INHUMAN Calponin HI, smooth muscle (Basic 1,8 1 

(P5I911) calponin) 

KAP3_HUMAN cAMP-dependent protein kinase type II- Singleton L 1 

(P31323) beta 

CABA_MOUSE CARG-binding factor-A (CBF-A). 0.97 0.33 3 

(Q99020) 

CATB^HUMAN Cathepsin B precursor (EC 3.4.22.1) 0.88 0.70 2 

(P07858) ' (Cathepsin Bl) 

CB32_YEAST Centromere DNA-binding protein NQ 1 

(P40969) complex CBF3 subunit 

BPHB_BURCE Cis-2,3-dihydrobiphenyl-2,3-diol Singleton L 1 

(P47227) dehydrogenase (EC 

CLH^BOVIN Clathrin heavy chain, 0.62 1 

(P49951) 

COFl_HUMAN Cofilin, non-muscle isoform (18 kDa 0.95 0.15 3 

(P23528) phosphoprotein) 

C03_HUMAN Complement C3 precursor 1.1 0.1 2 

(P01024) 

CFAH_HUMAN Complement factor H precursor (H factor NQ 

(P08603) 1). 

AXO 1_CHICK Contactin 2 precursor (Axonin-1). 0.85 
(P28685) 

KCRB_HUMAN Creatine kinase, B chain (EC 2.7.3.2) (B- 6.0 

(P 12277) CK). 

CRBH HUMAN Crumbs protein homolog 1 precursor. NQ 
(P82279) 

PYRG_STAAM CTP synthase (EC 6.3,4.2) (UTP- Singleton H 

(Q99SD1) ammonia ligase) (C 

CYSR_HUMAN Cysteine-rich protein 1 (CRP 1) (CRP). 4.4 
(P21291) 

DDX4„HUMAN DEAD-box protein 4 (VASA homolog). Singleton H 
(Q9NQI0) 

DEST^HUMAN Destrm (Actin-depolymerizing factor) 2.4 

(P18282) (ADF). 

EF2_HUMAN Elongation factor 2 0.60 
(P13639) 

XYNA_PIRSP Endo-l,4-beta-xylanase A precursor (EC 0.32 

(Q 12667) 3.2.1.8) (1 



I 
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Accession # 

ENOA_HUMAN 

(P06733) or 
MPB1_HUMAN 
(P22712) 



Description 

Enolase(EC 4,2. 1.11) (2- 
phosphoglycerate dehydrat.) or c-myc 
promoter binding protein (MBP-1) 



Average H:L SD 
ratio 

2.6 1.9 



Frequency 



ENV_SIVM1 ^ 
(P05885) 

LEM2_CANFA 
(P33730) 

FIBG_HUMAN 
(P02679) 

FLNA^HUMAN 
(P21333) 

ALFA_HUMAN 
(P04075) 

G25P_HUMAN 

(P25763) or 
RAC1_HUMAN 

(P15154) or 
RHOJ_HUMAN 

(Q9H4E5) 



Envelope polyprotein GP160 precursor Singleton L 



E-selectin precursor (Endothelial 
leukocyte adhesi 

Fibrinogen gamma chain precursor 



Filamin A (Alpha-filamin) 
(Filamin 1) 

Fructose-bisphosphate aldolase A (EC 
4.1.2.13) (Mu 

G25K GTP binding protein (CDC 42 
homolog) or Ras related C3 botulinum 
toxin substrate 1 or 2 or 3 or Rho-r elated 
GTP-binding protein Rho G or RhoJ 



Singleton H 



1.2 



1 



1.6 



1-2 



NQ 



0.8 



0.2 



1 



LEGl CRIGR 
(P48538) 


Galectin- 1 (Beta-galactoside-binding 
lectin L-14-I) 


0.67 


0.52 


3 


GELS HUMAN 
(P06396) 


Gelsolin (Actin-depolymcrizing factor) 

(ADF) 


1.4 




2 


NMZl HUMAN 
(Q05586) 


Glutamate [NMD A] receptor subunit zeta Singleton H 

1 precursor 




3 


GTP_HUMAN 
(P09211) 


Glutathione S-transferase P 
(EC 2.5.1.18) 


1.3 


0.4 


3 


G3P2 HUMAN ^ 
(P04406) 


Glyceraldehyde 3-phosphate 
dehydrogenase 


1.0 


0.0 


3 


PAEP HUMAN 
(P09466) 


Glycodelin precursor (GD) (Pregnancy- 
associated en 


Singleton H 




1 


HPTl HUMAN 
(P00737) 


Haptoglobin- 1 precursor. 


2.2 


1.6 


2 


HS70 HUMAN 
(08107) 


Heat shock cognate 7 1 IcDa protein. 


1.3 


1.1 


3 


HS9A HUMAN 

(P07900) 


Heat Shock Protein HSP90-alpha 


NQ 




1 


HBB HUMAN 
(P02023) 


Hemoglobin beta chain. 


0.60 


0.45 


3 


HEMO HUMAN 
(P02790) 


Hemopexin precursor (Beta-IB- 
glycoprotein). 


1.1 


0.3 


3 
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Accession # 

ROD^HUMAN 
(Q14103) 

ROK_HUMAN 
(Q07244) 

HNT1_HUMAN 
(P49773) 

YF48_HUMAN 
(Q9HCM4) 

Y885_MYCTU 
(Q10546) 

GC1_HUMAN 
(P01857) 

GC2_HUMAN 

(P01859) or 
GC4__HUMAN 
(P01861) 



Description Average H:L SD 

ratio 

Heterogeneous nuclear ribonucleoprotein 1.0 0.2 
DO (hnRNP DO) 

Heterogeneous nuclear ribonucleoprotein 0.6S 
K (hnRNP K) 

Histidine triad nucleotide binding protein 1.4 

1 

Hypothetical protein KIAA1548 0.85 



Hypothetical protein Rv0885. NQ 

Ig gamma- 1 chain C region. 0-76 0.23 

Ig gaiTima-2 chain C region. Or Ig 1.0 0.5 
gamma-4 chain C region 



Frequency 
3 

1 



1 



GC3_HUMAN 
(P01860) 

KAC„HUMAN 
(P01834) 



Ig gamma-3 chain C region 



Ig kappa chain C region. 



HUMAN (POl 876) Ig alpha - 1 chain C region or Ig alpha-2 
or (PO 1 877) chain C region 



0.85 



0.88 



1.4 



0.22 



0.37 



1 



IDI2_RICCN ' 
(Q92HM7) 

TRFL_MOUSE 
(P08071) 

LGUL^HUMAN 
(Q04760) 

CSR3_HUMAN 
(P50461) 

LDH„HUMAN 
(P00338) or 
(P07195) 

MIF^HUMAN 
(P14174) 

MSRE^MOUSE 
(P30204) 

MDHM_HUMAN 
(P40926) 

MSP1_PLAF3 
(P 19598) 



Isopentenyl-diphosphate delta-isomerase Singleton H 

(EC 5.3.3. 

Lactotransferrin precursor (Lactoferrin). Singleton H 



Lactoylglutathione lyase (EC 4.4.1.5) 
(Methylglyox 

LIM domain protein, cardiac (Muscle 
LIM protein) ( 

L-lactate dehydrogenase (EC 1.1.1.27) 

(LDH). 



Macrophage migration inhibitory factor 

(MIF) 



1.0 



4.2 



1.4 



1.2 



Macrophage scavenger receptor types I Singleton H 

and II 



Malate dehydrogenase, mitochondrial 

precursor 



NQ 



Merozoite surface protein 1 precursor Singleton L 



0.2 



0.3 



0.7 



1 



1 



1 
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Accession # 

MU5A_HUMAN 
(P98088) 



Description 

Mucin 5 AC (Mucin 5 subtype AC, 
tracheobronchial) 



Average H:L 
ratio 
NQ 



SD 



MYH9_HUMAN Myosin heavy chain non muscle type A or 0.29 

(P35579) or B 
MYHA^HUMAN 
(P35580) 



KML2_RABIT 
(P07313) 

NIRJSTEUCR 
(P38681) 

NOG3_BRARE 
{Q9YHV3) 

NDKA_HUMAN 

(P15531) or 
NDKB_HUMAN 
(P22392) 



Myosin light chain kinase 2 

Nitrite reductase [NAD(P)H] 
(EC 1.7.1.4). 

Noggin 3 precursor. 
Nucleotide diphosphate kinase A or B 



Singleton H 



1.3 



1.3 



1.2 



CYPH_HUMAN Peptidyl-prolyl cis-trans isomerase A (EC 1.2 



(P05092) 



5.2.1.8) 



PEBP_BOVIN Phosphatidylethanolamine-binding protein 1.0 



(P 13696) 

PGK_HUMAN 
(P00559) 



(PEBP) (B 
Phosphoglycerate kinase (EC 2.7.2.3). 



PMG1_HUMAN Phosphoglycerate mutase 1 (EC 5.4.2. 1) 

(EC 5.4.2.4 



0.99 



0.73 



(P18669) 

POL^RSVP ^ 
(P03354) 

PIGR_HUMAN 
(P01833) 



POL polyprotein 



Polymeric- immunoglobulin receptor 
precursor (Poly- 



Singleton H 



1.1 



AR34_DROME Probable ARP2/3 complex 34 kDa subunit 
(Q9VIM5) (P34-ARC), . 



AMPA_CAMJE 
(Q9PP04) 



Probable cytosol aminopeptidase (EC 
3.4.11.1) (Leu 



3.5 



0.93 



0,2 



0.2 



Frequency 
1 



1 



1 



1 



PR01_HUMAN 
(P07737) 

PROP_HUMAN 
(P27918) 

PRTH„PORGI 
(P46071) 

PSA5_HUMAN 
(P28066) 



Profilin I 

Properdin precursor (Factor P). 

Protease prtH (EC 3.4.22.-). 

Proteasome subunit alpha type 5 (EC 
3.4.25.1) (Pro 



1.1 



Singleton H 



Singleton L 



2.1 



1 
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Accession # 

VN02_VACCC 
(P20641) 

FRT INHUMAN 
(Q92837) 

022C_DROME 
(P81911) 

KPY1_HUMAN 
(P 146 18) 

CUTl_SCHPO 
(PI 8296) 

TRFE_HUMAN 

(P02787) 

ALBU^HUMAN 
(P02768) 

STC_DROME 
(P40798) 

SLIl^HUMAN 
(Q13642) 

PCPl__SCHPO 
(Q92351) 

SODC_HUMAN 
(P00441) 

THIO_HUMAN 
(P10599) 

TPIS_HUMAN 
(P00938) 

TPM INHUMAN 
(P09493) 

TPM4_HUMAN 

(P07226) or 
TPM2_HUMAN 
(P07951) 



Description 

Protein N2. 

Proto-oncogene FRATl 
Putative odorant receptor 22c. 



Pyruvate kinase. Ml isozyme (EC 
2.7.1.40) (Pyruvat 

Separin (EC 3.4.22.-). 



Serotransferrin precursor (Transferrin) 

(Siderophi 

Serum albumin precursor. 



Shuttle craft protein. 
Skeletal muscle LIM protein 1 
Spindle pole body protein pcpl. 

I 

Superoxide dismutase [Cu-Zn] 

Thioredoxin. 

Triosephosphate isomer ase (EC 5.3.1.1) 

(TIM). 

Tropomyosin 1 alpha chain (Alpha- 
tropomyosin). 

Tropomyosin alpha 4 chain or beta 2 

chain 



Average H:L 

ratio 
Singleton L 

Singleton H 

0.90 

1.2 

NQ 

1.3 



SD 



Frequency 
1 



1.1 



1.1 



2.0 



Singleton H 



0.92 



0.87 



1.1 



2.3 



0.75 



0.31 



0.4 



0.4 



0.3 



0.33 



0.5 



0.7 



1 



1 



TBA__HUMAN 

(P05209) 

TBB^HUMAN 
(P07437 or 
P05217)) 

PTK7_HUMAN 
(Q 13308) 

SPK1_DUGTI 
(P42687) 

VSIl^TRYBB 

(P26326) 



Tubulin alpha chain. 



Tubulin beta chain (Fragment). 



Tyrosine-protein kinase like 7 precursor 

Tyrosine-protein kinase SPK-1 
(EC 2.7.1.112). 



1.2 



0.77 



0.35 



Singleton L 



Variant surface glycoprotein ILTAT 1.21 Singleton L 

precursor 



0.5 



0.25 



1 
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Accession # 

PGCV_MACNE 
(Q28858) 

VIME_HUMAN 
(P08670) 

VINC^HUMAN 
(PI 8206) 

VTDB__HUMAN 
CP02774) 

WDR1_HUMAN 
(075083) 

Z363_HUMAN 
(Q96PM5) 



Description 

Versican core protein 

Vimentin (Fragment). 

Vinculin (Metavinculin). 

Vitamin D-binding protein precursor 
(DBP) (Group-s 

WD-repeat protein 1 (Actin interacting 

protein 1) 

Zinc finger protein 363 



Average H:L 
ratio 
4.3 



1.6 



SD 



3.1 



1.3 



1.6 



Singleton L 



2.2 



0.4 



1.0 



Frequency 
1 



1 



10 



Note: Frequency is the number of sample pairs in which the protein is observed 
NQ = Not quantified 

Singleton L — Only observed in Proliferative phase 
Singleton H = Only observed in Secretory phase 

* : Subsequently identified by Mascot as Clostridium putative phosphatidylserine decarboxylase 
proenzyme (gi: 28209853) 

^ : Internal standard to which other proteins' peak areas are normalized 
° : Rickettsia conorii protein 
: Subsequently identified by Mascot as unlabeled serum albumin 
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' Table 7 

Protein abundances in nonmalignant endometrial tissue honiogenates. 



Case 


Histopathologic 
Classification 


Chaperonin 10 
(relative abundance)^ 


Second protein* 
(relative abundance) 


1 


Jr I UllJ.Cxa1.lVC 


1-1- 


n 
u 


2 


"Pit*! i^^tqIi 

JT 1 LI11J.C1 dLlVC 


1_L 
1~ 


u 


ri 

i 


oct>rciui jr 


14- 


A 


4 








5 


xjciiigii jjuijr^ 


0 


n 
u 


fi 
\j 


jnLLLUjJIliV 


1 + 


1 4- 


7 


O C vl C lL/1 y 


0 


1 4- 


8 
o 


I.-'IoVJl viCi CU jpiUlilCl dLL VC 


0 


n 

U 


9 


Si ecret firv 


0 


14- 

IT^ 


10 


rii QorH prpH Tirol i fi^Tsif i 


0 


14- 

i~ 


1 1 

i. L 


OCwl C LLll y 


0 


14- 


12 


Lj wL;X w LVJX Jf 


.X 1 






OCL/lCLUly / mcilollLlai 






1 A 


occrciory 


1-4- 


A 


1 < 


jrVllOpillt/ 


n 


14- 


16 


oc vix Ciux y 


11 




17 


Menstrual 


1+ 


1+ 


18 


Secretory 


1+ 


0 


19 


Proliferative 


n- 


0 


20 


Proliferative 


1+ 


0 


21 


Menstrual 


0 


0 


22 


Atrophic 


0 


1+ 


23 


Atrophic 


2+ 


1+ 



0 = absence, 1+ = S/N 2-3, 2+ = S/N 3-5, 3+ = S/N 5-7, 4+ == S/N 7-10, 5+ = S/N > 10. 
5 * calgranulin A. 

cervical cancer present several centimeters away. 
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Table 8 

Protein abundances in malignant endometrial tissue homogenates. 

Case Histopathologic Classification Chaperonin 10 Second protein^ 

(relative abundance)^ (relative abundance) 



24 


, Em AdCa 1/3 


2+ 


1+ 


25 


Em AdCa 1/3 


0 


5+ 


26 


MMMT 


5+ 


1+ 


27 


Em AdCa 


2+ 


5+ 


28 


MMMT 


1+ 


4+ 


29 


Em AdCa 2/3 


1+ 


3+ 


30 


Em AdCa 2/3 


1+ 


4+ 


. 31 


Ser AdCa • 


1+ 


3-{- 


32 


Em AdCa 1/3 


5+ 


2+ 


33 


Em AdCa Grade n.k. 


0 


0 


34 


Em AdCa Grade n.k. 


1+ 


3+ 


35 


Em AdCa Grade n.k. 


0 

1 


0 




Em AdCa 1/3 


5+ 


0 


37 


Em AdCa 1/3 


4+ 


2+ 


38 


Em AdCa 1/3 


3+ 


5+ 


39 , 


Muc AdCa 1/3 


2-H 


3+ 


40 


Em AdCa 1/3 


2+ 


2-1- 


41 


Em AdCa 1/3 


3+ 


1+ 


42 


Em AdCa - Ser AdCa 


5+ 


0 


43 


Em AdCa 1/3 


0 

* 


5+ 


44 


Em AdCa 2/3 


4-1- 


4+ 



t 0 = absence, + = S/N 2-3, 2+ = S/N 3-5, 3+ = S/N 5-7, 4+ = S/N 7-10, 5-h - S/N > 10. 
5 * calgranulin A. 

Em AdCa - Endometrioid adenocarcinoma 
MMMT = Malignant Mixed Mullerian Tumor 
Muc AdCa = Mucinous adenocarcinoma 
n.k. = not loiown 
10 Ser AdCa = Serous adenocarcinoma 
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Table 9 



Proteins in SEC fraction 7 of Endometrial Cancer Tissue IdentiHed by nanoLC/MS/MS 



10 



15 



20 



25 



30 



35 



Name 


Confidence^ 


Averaeed mol w£^ifT>i+ 


Calgranulin A 


99 




'Calgranulin B 


99 


13 242 


Heat shock protein 10 


99 


10 soo 

X vy, o \J\j 


Hemoglobin alpha chain 


90 


15 126 


Hemoglobin beta chain 


99 


15 867 


Putative nucleoside diphosphate kinase 


75 


15 529 


Nucleoside diphosphate kinase A 


75 


17 149 


Nucleoside diphosphate kinase B 


75 


17 298 


Cofilin, non-muscle isoform 


90 


18 502 


Cofilin, muscle isoform 


90 


18,736 


Translationally controlled tumor protein 


90 


19,595 


Peroxiredoxin 1 


99 


22 110 


Peroxiredoxin 2 


99 


21 892 


Neutrophil gelatinase-associated lipocalin 


99 


22,588 


Glutathione S-transferase P 


99 


■ 23,224 


Triosephosphate isomerase 


90 


26,538 


Peroxiredoxin 4 


90 


30,540 


NADPH dehydrogenase 


90 


30,867 


Ig gamma-3 chain C region 


75 


32,331 


Ig gamma-2 chain C region 


75 


35,884 


Ig gamma-4 chain C region 


75 


35,940 


Ig gamma- 1 chain C region 


75 


36,106 


C-myc promoter-binding protein 


99 


37,087 


Hetero. nuclear ribonucleoproteins A2 


99 


37,429 


Cathepsin B precursor 


99 


37,807 


Beta enolase 


75 


46,855 


Alpha enolase 


99 


47,037 


Gamma enolase 


75 


47,137 


Alpha enolase, lung specific 


99 


49,477 


Serum albumin precursor 


99 


69,366 


Polymeric- immunoglobulin receptor precursor 


75 


83,313 


Cone cGMP-specific 3',5'-cyclic phosphodiesterase 


75 


99,102 



^ "Confidence" value in ProID: the highest possible score is 99. 
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Table 10 



Immunohistochemical identification of calgranulinA in tissue microarray 
using two monoclonal mouse antibodies (RDI Sc USB) 

5 



Histologic Macrophage/ Epithelium/ Stroma/ 
ClassiHcation Granulocytic Carcinoma Myometrium 
RDI USB RDI USB RDI USB 

10 

Benign endometrium, 

n=10j: 0(1) 0(0) 0(9) 0(9) 0(10) 0(10) 

7+ (6) 1+(10) y+(l) 1+(1) i+(0) l+(0) 

2+ (3) 2+(0) 2+(0) 2+(0) 2+(0) 2+ (0) 

15 5+(0) 3+(0) 5+(0) 3+(0) 5+(0) 3+ (0) 

Atypical endometrial 

hyperplasia, n = 2 i+ (2) 1+ (2) i+ (2) l+(2) 0(2) 0(2) 

20 

EmCa,n=13t 0(1) 0(1) 0(6) 0(7) 0(12) 0(13) 

7+ (7) l+(8) 7+ (4) l+(2) i+(0) l+(0) 

2+ (3) 2+ (3) 24- (2) 2+ (2) 2+(0) 2+ (0) 

3+(l) 3-^(0) 3+ (2) 5+(0) 3+ (0) 

Non-endometrial 

carcinomas, n = 4 0(0) 0(1) 0(2) 0(2) 0(4) 0(4) 

7+(l) 1+(1) 7+(l) 1+(1) 7+(0) l+(0) 

2+ (3) 2+ (2) 2+(l) 2+(l) 2+(0) 2+ (0) 

30 3+(0) 3+(l) i+(0) 34-(0) 5+(0) 3+(0) 

Benign non-endometrial 

neoplasms, n = 2 0(0) 0(1) 0(0) 0(0) 0(0) 0(0) 

35 



25 



()~ Number of samples 
0 = No immunostaining 
7+ = Positivity in occasional or few cells 
40 2+ = Positivity in more abundant cells 
5+ = Strong positivity in numerous cells 
t = In TMA- RDI, one core of EmCa was not displayed 

J = 1 of 1 1 benign endometrial cores (secretory) did not display in either TMA. 

45 
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The present invention is not to be limited in scope by the specific embodiments described herein, since 
such embodiments are intended as but singLe illustrations of one aspect of the invention and any functionally 
equivalent embodiments are within the scope of this invention. Indeed, various modifications of the invention in 
5 addition to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying drawings. Such modifications are intended to fall within the scope of the 
appended claims. 

All publications, patents and patent applications referred to herein are incorporated by reference in their 
entirety to the same extent as if each individual publication, patent or patent application was specifically and 

10 individually indicated to be incorporated by reference in its entirety. All publications, patents and patent 
applications mentioned herein are incorporated herein by reference for the purpose of describing and disclosing 
the antibodies, melhodologies etc. which are reported therein which might be used in connection with the 
invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate such 
disclosure by virtue of prior invention. 

15 It must be noted tliat as used herein and in the appended claims, the singular forms "a", "an", and "the" 

include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a cell" 
includes a plurality of such cells, reference to the "antibody" is a reference to one or more antibodies and 
equivalents thereof known to those skilled in the art, and so forth. 
Below full citations are set out for references. 

20 
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We Claim ; 

1. A method.for detecting one or more endometrial markers or polynucleotides encoding the markers 
associated with an endometrial disease or an endometrium phase in a subject comprising: 

5 (a) obtaining a sample from a subject; 

(b) detecting in proteins extracted from the sample one or more endometrial markers or 
polynucleotides encoding the markers that are associated with the disease or phase; and 

(c) comparing the detected amount with an amount detected for a standard. 

2. A method of detecting an endometrial disease in a subject, the method comprising comparing: 

10 (a) levels of one or more endometrial markers associated with endometrial disease that are 

extracted from a sample from the subject; and 

(b) normal levels of expression of the endometrial markers in a control sample, wherein a 
significant difference in levels of endometrial markers, relative to the corresponding normal 
levels, is indicative of endometrial disease. 

15 3. A method as claimed in claim 1 or 2 comprising: 

(a) contacting a biological sample obtained from a subject with one or more binding agent that 
specifically binds to the endometrial markers or parts thereof; and 

(b) detecting in the sample amounts of endometrial markers that bind to tlie binding agents, 
relative to a predetermined standard or cut-off value, and therefrom detennining the presence 

20 or absence of the endometrial disease in the subject. 

4. . A method as claimed in claim 3 wherein the binding agent is an antibody. 

5. A method for screening a subject for endometrial cancer comprising (a) obtaining a biological sample 
from a subject; (b) detecting in proteins extracted from the sample the amount of one or more 
endometrial cancer markers; and (c) comparing the amount of endometrial cancer markers detected to a 

25 ' predetermined standard, where detection of a level of endometrial cancer markers different than that of 

a standard is indicative of endometrial cancer. 

6. A method of claim 5 wherein the level of endometrial cancer markers are significantly higher compared 
to the standard and are indicative of endometrial cancer. 

7. A method of claim 5 wherein the level of endometrial cancer markers are significantly lower compared 
30 to the standard and are indicative of endometrial cancer. 

8. A method as claimed in any preceding claim wherein the sample is obtained from tissues, extracts, cell 
cultures, cell lysates, lavage fiuid, or physiological fluids. 

9. ' A method as claimed in claim 8 wherein the sample is obtained from a tumor tissue. 

10. A method as claimed in any preceding claim which further comprises detecting multiple cancer 
35 markers. 

11. A method for determining the presence or absence of endometrial markers associated with an 
endometrial disease in a subject comprising detecting one or more polynucleotide encoding an 
endometrial marker in a sample from the subject and relating the detected amount to the presence of an 
endometrial disease. 
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12. A method as claimed in claim 1 1 wherein the polynucleotide detected is mRNA. 

13. A method of claim 12 wherein the polynucleotide is detected by 

(a) contacting the sample with oligonucleotides that hybridize to the polynucleotides; and 

(b) detecting in the sample levels of nucleic acids that hybridize to the polynucleotides relative to 
5 a predetermined standard or cut-off value, and therefrom determining the presence or absence 

of an endometrial disease in the subject. 

14. A method as claimed in claim 12 wherein the mRNA is detected using an amplification reaction. 

15. A method as claimed in claim 14 wherein the amplification reaction is a polymerase chain reaction 
employing oKgonucleotide primers that hybridize to the polynucleotides, or complements of such 

10 polynucleotides. 

16. A method as claimed in claim 12 wherein the mRNA is detected using a hybridization technique 
employing oligonucleotide probes tliat hybridize to the polynucleotides or complements of such 
polynucleotides. 

17. A method as claimed in claim 14 wherein the mRNA is detected by (a) isolating mRNA from the 
15 sample and combining the mRNA with reagents to convert it to cDNA; (b) treating tlie converted 

cDNA with amplification reaction reagents and primers that hybridize to the polynucleotides, to 
produce amplification products; (d) analyzing the amplification products to detect an amount of mRNA 
encoding one or more endometrial markers; and (e) comparing the amount of mRNA to an amount 
detected against a panel of expected values for normal tissue derived using similar primers, 
20 18. A method for diagnosing and monitoring endometrial cancer in a subject comprising isolating nucleic 

acids in a sample from the subj ect; and detecting polynucleotides encoding endometrial cancer markers 
in the sample wherein the presence of higher or lower levels of polynucleotides encoding endometrial 
cancer markers in the sample compared to a standard or control is indicative of disease or prognosis. 

19. A method for monitoring the progression of endometrial cMicer in a subject, the method comprising: (a) 
25 detecting in a sample from the subject at a first time point, one or more endometrial cancer markers or 

polynucleotides encoding the markers; (b) repeating step (a) at a subsequent point in time; and (c) 
comparing levels detected in steps (a) and (b), and thereby monitoring the progression of endometrial 
cancer. 

20. A method for determining in a subject whether endometrial cancer has metastasized or is likely to 
30 metastasize in the future, the method comprising comparing (a) levels of one or more endometrial 

cancer markers or polynucleotides encoding the markers, in a subject sample; and (b) normal levels or 
non-metastatic levels of the endometrial cancer markers or polynucleotides encoding the markers, in a 
control sample wherein a significant difference between the levels of expression in the subject sample 
and the normal levels or non-metastatic levels is an indication that the endometrial cancer has 
35 metastasized. 

21. A method for assessing the aggressiveness or indolence of endometrial cancer comprising comparing: 
(a) levels of expression of one or more endometrial cancer markers or polynucleotides encoding the 
markers, in a subject sample; and (b) normal levels of expression of the endomefrial cancer markers or 
polynucleotides encoding the markers, in a control sample, wherein a significant difference between the 
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levels in the subject sample and normal levels is an indication that the cancer is aggressive or indolent. 
A diagnostic composition comprising an agent that binds to an endometrial canc^ marker or hybridizes 
to a polynucleotide encoding such marker. 

A method for assessing the potential efficacy of a test agent for inhibiting endometrial cancer in a 
subject, the method comprising comparing: (a) levels of one or more endometrial cancer markers, in a 
first sample obtained from a subject and exposed to the test agent, wherein the endometrial cancer 
markers, and (b) levels of the endometrial cancer markers in a second sample obtained from the subject, 
wherein the sample is not exposed to the test agent, wherein a significant difference in the levels of 
expression of the endometrial cancer markers in the first sample, relative to the second sample, is an 
indication that the test agent is potentially efficacious for inhibiting endometrial cancer in the subject. 
A method of assessing the efficacy of a therapy for inhibiting endometrial cancer in a subject, the 
method comprising comparing: (a) levels of one or more endometrial cancer markers in a first sample 
obtained from the subject; and (b) levels of the endometrial cancer markers in a second sample obtained 
from the subject following therapy, wherein a significant difference in the levels of expression of the 
endometrial cancer markers in the second sample, relative to the first sample, is an indication that the 
therapy is efficacious for inhibiting endometrial cancer in the subject. 

A method of selecting an agent for inhibiting endometrial cancer in a subject the method comprising (a) 
obtaining a sample comprising cancer cells from the subject; (b) separately exposing aliquots of the 
sample in the presence of a plurality of test agents; (c) comparing levels of one or more endometrial 
cancer markers in each of ^e aliquots; and (d) selecting one of the test agents which alters the levels of 
endometrial cancer markers in the aliquot containing that test agent, relative to other test agents. 
A method of inhibiting endometrial cancer in a subject, the method comprising (a) obtaining a sample 
comprising cancer cells from the subject; (b) separately maintaining aliquots of the sample in the 
presence of a plurality of test agents; (c) comparing levels of one or more endometrial cancer markers in 
each of tlie aliquots; and (d) administering to the subject at least one of the test agents which alters the 
levels of endometrial cancer markers in the aliquot containing that test agent, relative to other test 
agents. 

A method of assessing the endometrial cancer cell carcinogenic potential of a test compound, the 
method comprising: (a) maintaining separate aliquots of endometrial cancer cells in the presence and 
absence of the test compound; and (b) comparing expression of one or more endometrial cancer 
markers, in each of the ahquots, and wherein a significant difference in levels of endometrial cancer 
markers in the aliquot maintained in the presence of the test compound, relative to the aliquot 
maintained in the absence of the test compound, is an indication that the test compoimd possesses 
endometrial cancer cell carcinogenic potential. 
An in vivo method for imaging an endometrial disease comprising: 

(a) injecting a subject with one or more agent that binds to an endometrial marker, the agent 



(b) 
(c) 



carr)dng a label for imaging the endometrial marker; 

allowing the agent to incubate in vivo and bind to an endometrial marker; and 
detecting the presence of the label localized to diseased endometrial tissue. 
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29. A method as claimed in claim 28 wherein the agent is an antibody that specifically reacts with an 
endometrial marker. 

30. Markers that distinguish an endometrium phase or endometrial disease identified by assaying for 
differential expression of polypeptides in endometrium samples. 

5 31. Markers as claimed in claim 3 0 wherein differential expression is assayed using mass spectroscopy of 

polypeptides extracted from the samples. 

32. Markers of claim 3 1 which are up-regulated in endometrial cancer. 

33. Markers of claim 3 1 which are dbwn-regulated in endometrial cancer. 

34. A set of markers of claim 30 or 3 1 comprising a plurality of polypeptides comprising or consisting of at 
10 least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, or 15 of the markers hsted in Table 1, 4, 5, or 6. 

35. A set of markers of claim 34 wherein the polypeptides are selected from the group consisting of 
polypeptides with the sequence ofSEQIDNOs. 1,3,6, 9, 11, 13, 15, 18,21, 23, 30,33, 36, 38, and 40. 

36. A set of markers of claim 31 wherein the polypeptides are selected from the group consisting of 
polypeptideswiththe sequence ofSEQIDNOs. 1,3, 6,9, 11,13, 15, 18,21,23,26,30, 33,36,38,40, 

15 42, 45, and 47. 

37. A set of markers of claim 31 wherein the polypeptides are selected from the group consisting of 
polypeptides with the sequence of SEQ ID NOs. 26, 42, 45, and 47. 

38. A method of any preceding claim wherein the endometrial markers are one or more of the polypeptides 
listed in Table 1 or they have a sequence ofSEQIDNOs. 1,3, 6, 9, 11, 13, 15, 18,21, 23, 26, 30, 33, 

20 36, 3 8, 40, 42, 45, and 47. 

39. A method of any preceding claim wherein the endometrial marker is chaperonin 10. 

40. A method of any preceding claim utilizing markers of claim 29, 30 or 3 1 or a set of markers of claim 34 
or 36. 

41. A method of determining uterine endometrial receptivity by first obtaining a serum, uterine fluid or 
25 endometrial biopsy sample from a subject and detecting the presence of an endometrial marker 

associated with a certain endometrium phase, wherein the presence or absence of an endometrial marker 
as compared to controls indicates uterine receptivity. 

42. A method of claim 41 wherein the endometrium phase is the secretory or proliferative phase. 

43 . A method of monitoring the effects of ovarian hyperstimulation and/or ovulation inductioii protocols on 
30 uterine receptivity which comprises: (a) obtaining a serum, uterine or fluid or endometrial biopsy 

sample from a subject receiving the treatments; and (b) detecting the presence of an endometrial marker 
present in the endometrium at the time of fertilization, early embryogenesis, and implantation; wherein 
presence or absence of an endometrial marker indicates receptivity. 

44. A method of determining a probability of successful implantation with an ovarian stimulation in vitro 
35 fertilization and embryo transfer procedure, comprising: 

(a) determining a level of an endometrial marker in a sample obtained from a subject who has 
undergone an ovarian stimulation in vitro fertilization and embryo transfer procedure; and 

(b) determining a probability of successful implantation based on the subject's determined 
endometrial marker level; 
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wherein a significantly different endometrial marker level relative to a standard level is associated with 
a decreased or increased probability of successful implantation. 

A method of any of claim 41 to 44 wherein the endometrial marker is glutamate receptor subunit zeta 1, 
a tryptic fragment thereof, and/or macrophage migration inhibitory factor. 

A method of contraception by interrupting the cyclic presence of an endometrial marker, in particular 
glutamate receptor subunit zeta 1, a tryptic fragment thereof, macrophage migration inhibitory factor, 
myosin light chain kinase 2, and/or tropomyosin 1 alpha chain. 
47. A kit for carrying out a method as claimed in any preceding claim. 

A kit for determining the presence of an endometrial disease in a subject, comprising a known amount 
of one or more binding agent that specifically binds to an endometrial marker wherein the binding agent 
comprises a detectable substance, or it binds directly or indirectly to a detectable substance. 
A kit for determining the presence of endometrial disease in a subject, comprising a known amount of 
an oligonucleotide that hybridizes to a polynucleotide encoding an endometrial marker wherein the 
oligonucleotide is directly or indirectly labeled with a detectable substance. 
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Sequence Listing 
SEQ ID NO. 1 



5 Q04984 and AAH23518 
Chaperonin 1 0 



10 



1 magqafrkfl plfdrvlver saaetvtkgg imlpeksqgk vlqatwavg sgskgkggei 
61 qpvsvkvgdk vllpeyggtk wlddkdyfl frdgdilgky vd 



SBQ ID NO. 2 



WM_002157 and U07550 
15 Human chaperonin 10 mRNA, complete cds 



1 gctacactag agcagagtac gagtctgagg cggagggagt aatggcagga caagcgttta 
61 gaaagtttct tccactcttt gaccgagtat tggttgaaag gagtgctgct gaaactgtaa 
121 ccaaaggagg cattatgctt ccagaaaaat ctcaaggaaa agtattgcaa gcaacagtag 
20 181 tcgctgttgg atcgggttct aaaggaaagg gtggagagat tcaaccagtt agcgtgaaag 
241 ttggagataa agttcttctc ccagaatatg gaggcaccaa agtagttcta gatgacaagg 
3 01 attatttcct atttagagat ggtgacattc ttggaaagta cgtagactga aataagtcac 
3 61 tattgaaatg gcatcaacat gatgctgccc attccactga agttctgaaa tctttcgtca 
421 tgtaaataat ttccatattt ctcttttata ataaactaat gataactaat gacatccagt 
25 481 gtctccaaaa ttgtttcctt gtactgatat aaacacttcc aaataaaaat atgtaaat 



SEQ ID NO. 3 



30 P05109 

Calgranulin A 

1 mltelekaln siidvyhkys likgnfhavy rddlkkllet ecpqyirkkg advwfkeldi 
61 ntdgavnfqe flilvikmgv aahkkshees hke 

35 



SEQ ID NO. 4 



A12027 

40 Macrophage migration inhibition factor {MRP-14) cDNA from Human 
placenta (formula v) 



1 cttgggttgc ttccaccttt tggctcttgt aaataatgct gctatgaaca tgaatgtaca 
61 aacatctgtt tgaatccctg cattcaattc ttttgcatat atacccagga gcagaatgat 

45 121 ggatcatatg gtaattctgt gtttatttat ttgaggaaca aacttgccgt tttccataac 
181 agctgcacta ttttacattc ccactaacag tgcattaggc ttccaattct ctatgccctc 
241 accaacactt gttttctggg ttttaaaaga agtagtagtc atccttgtag gtgtcaggtg 
301 gtatctcatt gtcgttttgc ttcatgtttt cctaaagatt agtaattttc atatgcttat 
361 tgaccatttg tatatcttct tcggagaagt gtctatttga gtctttcccc aattttgatt 

50 421 ggtttgtttg ttttttgttg ttgagttgta gggattcttt tatattctgg atattaatcc 
481 cttatcagat atttgtttta caaatatttt ctttgtaaca acagaaacac accacagtct 
541 tcaaggttgg aagccagtta atctgagtag cattttgtta gtggtgggga gaggatttgt 
601 tcctcctgaa atcctgggga attggccacc tcctcttctc ctcttaggca tgaagcgcgt 
661 ctggcttctc caaagaactc ttcccctcca ctacctcaga gttagcttcc tctcttcagc 

55 721 cagtgatcct ggggtcccag acacaataat taaccaagag agggtgaaag gctccctgct 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/061725 



PCT/CA2004/002172 



-2- 



25 



781 gtgtttatgc aatggctcag gcccttgtga agtgccgagg gaccccaagc agcctccatc 
841 tcccagggca tggtccatcc ccagctttca cagaacagga aagctgtgga ggagtgtggg 
901 cagcagggta ggaatggata tagcccttgg caacaacaca tttccccaca aagcacccac 
961 ccaaaagaac aacaacgata gttttagttt ttagtaatga gaacaatagt tctcatgact 
5 1021 aaaagccatc agccaggaca ctgttctcaa cccttttgcg gtctttggac cctttgaaac 
1081 tctgacagaa gccatggagg aatgttctca ctgagtgcat gcactcaaaa- tgatgcattc 
1141 aacttcaatt cagtttcagg gatgtatggc ctgaccacca atgcagggga ttagcaatcg 

12 01 caatagtgga gagggcatgg gagtgggaat ctggctggat caagcaagtg gatgccagca 
1261 gcccagaaaa agagcccccc tacctgcttt ttccttcctg ggcactattg cccagcaaat 

10 1321 gccttcctct ttccgcttct cctacctccc cacccaaaat tttcattctg cacagtgatt 

13 81 gccacattca ctggttgaga aacagagact gtagcaactc tggcagggag aagctgtctc 
1441 tgatggcctg aagctgtggg cagctggcca agcctaaccg ctataaaaag gagctgcctc 
15 01 tcagccGtgc atgtctcttg tcagctgtct tfccagaagac ctggtaagtg ggactgtctg 
1561 ggttggcccc gcactttggg cttctcttgg ggagggtcag ggaagtggag cagccttcct 

15 1621 gagagaggag agagaaagct cagggaggtc tggagcaaag atactcctgg aggtggggag 
1681 tgaggcaggg ataaggaagg agagtatcct ccagcacctt ccagtgggta agggcacatt 
1741 gtctcctagg ctggactttt cttgagcaga gggtggggtg gtaaggaaag tctacgggcc 
18 01 cccgtgtgtg tgcacatgtc tctgtgtgaa tggacccttc cccttcccac acgtgtatcc 
18 61 ctatcatccc acccttccca ccagaggcca tagccatctg ctggtttggt tatttgagag 

20 1921 tgcaggccag gacaaggcca tcgcttgggg catgaatcct ctgcgtactg ccctggccag 
1981 atgcaaattc cctgccatgg gattccccag aaggttctgt ttttcaggtg gggcaagttc 

2 041 cgtgggcatc atgttgaccg agctggagaa agccttgaac tctatcatcg acgtctacca 
2101 caagtactcc ctgataaagg ggaatttcca tgccgtctac agggatgacc tgaagaaatt 
2161 gctagagacc gagtgtcctc agtatatcag ggtgaggagg ggctgggtgt ggcgggggct 
2221 ctctgcctgg tcctggggct gccctgggcc agcggtcctc cctgccaccc ttcatagatg 
2281 ctatgcctcg gctctctctg agatctttaa actctggctt cttcctcctc aatcttgaca 
2341 gaaaaagggt gcagacgtct ggttcaaaga gttggatatc aacactgatg gtgcagttaa 
2401 cttccaggag ttcctcattc tggtgataaa gatgggcgtg gcagcccaca aaaaaagcca 
2461 tgaagaaagc cacaaagagt agctgagtta ctgggcccag aggctgggcc cctggacatg 
2521 tacctgcaga ataataaagt catcaatacc tcatgcctct ctcttatgct tttgtggaat 
2581 gaggttcctc ggtgtggagg gagggttgga aaacccaaag gaagaaaaag aaatctatgt 
2641 tatcccaccc tacctctcac aagcctttcc tgctttaccc ctcacctggc ctctgcccca 
2701 cattccttca gcccctcatt tcgagcattg gatttgaggc ttaaggattc aaaaagtcgt 
2761 catgaatata gctgatgatt ttatagtggt tctgaaatgg gtcggggatt tgggaacagg 
2821 gtggtagtat aagaacaact gatactgttc tctaagctaa atcttagctt ccagctacct 
2881 gtcttagatg tggctcttgg gaaccttaga gtgatagcta catagaagtg tgtgggtgtg 
2941 tgtgtgtgtg tctgtgtgtg tgtgtgtgag agagagacag acagaaagag agcaagagag 

3 0 01 ggaagggggg agaggctgat tgtgtgtgtg gtgtgatgta ggtggacaat gttcagagtc 
3061 ctccattaac aggataatcc tcacacctgt ccacatacct gtagtttgtc cttggggatt 

40 3121 ttgaaaattt ttcctccctc tccactccca aactcccaac tcaattaaat gataaaggaa 
3181 taggcaaata ggaaaataaa ttagtaaaac ttaagtcaaa gaataggtta ttcatacgct 
3241 gcctatggga ttctatgctt tgtgatcaga aaattatcta aaaaatactt cccaagggct 
3 3 01 ggtacaaggg aggccagaag acgagtggtt cttctctgag gtggacatta aaaaaagaag 
3361 aaaatgaagg ggaacctttt gacaagaatg tcaccccaaa ctggattttc atgctgtggt 

45 3421 gtggggaatt ttctgttgtc ctcacttagg tgctggggca gtggtgttag tgatgggtaa 
3481 aaaggtagga agctgtcaca gaatcactaa accagggttc ttaacttgtc tgtctataca 
3541 tctctgaaat tgggttgaag ttgtgtgcat cattttgagt gacgcactga gaacattcct 
3601 ccacggcttc catcgagagt ctcgaaaagg cccaacacct caaaaaggtt aagaacactt 
3661 gtcctgctta ctggttttta gtaacaaatg gcagagtatt tctctctgtc tctctctctt 

50 3721 tttttttttt tttttttgag acacagggtc ttgtctgtca cgtggactag agtacaatgg 
3781 gcatgatcat gggctcactg tagcctcgaa cacctgggct caagtaatcc tcccacctca 
3 841 gcctctttag tagctgggac tacagcatga gccactgccc ttggctaatt tttaaattat 
3 901 ttttttgtag agatggaaac ttgctatgtt gcccaggcta gtctcaaact cctggactca 
3 961 agcgatcctc ctaccttggc ctcccaaagt gctgagatta cagtgtgatc cacaccacac 

55 4 021 ctggccaaag attggagtat ttttattgct attgttgtgc tgggtgggtg ggtgggtgta 
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4081 tgctttgtgg ggacgtgtgt tgttgccaag ggctaaatca gttcctaccc tgctgcccac 
4141 agtcctccac agctttcctg ctctgtgaag ctaaggatac accccgatga taagctgtca 
4201 acata 



SEQ ID NO. 5 

Homo sapiens SlOO calcium binding protein A8 (calgranulin A) S100A8, 
10 mRNA 

1 atgtctcttg fccagcfcgtct ttcagaagac ctggtggggc aagtccgtgg gcatcatgtt 
61 gaccgagctg gagaaagcct tgaactctat catcgacgtc taccacaagt actccctgat 
121 aaaggggaat ttccatgccg tctacaggga tgacctgaag aaattgctag agaccgagtg 
15 181 tcctcagtat atcaggaaaa agggtgcaga cgtctggttc aaagagttgg atatcaacac 
241 tgatggtgca gttaacttcc aggagttcct cattctggtg ataaagatgg gcgtggcagc 
3 01 ccacaaaaaa agccatgaag aaagccacaa agagtagctg agttactggg cccagaggct 
361 gggcccctgg acatgtacct gcagaataat aaagtcatca atacctcaaa aaaaaaaaaa 
421 aaaaaaaa 

20 

SEQ ID NO. 6 

P06702 

25 Calgranulin B/MRP-14 

1 mtckmsqler nietiintfh qysvklghpd tlnqgefkel vrkdlqnflk kenJaiekvie 
61 himedldtna dkqlsfeefi mlmarltwas hekmhegdeg pglihhkpglg egtp 



30 



45 



SEQ ID NO. 7 



X06233 

Human mRNA for calcium-binding protein in macrophages (MRP -14) 
35 macrophage migration inhibitory factor (MIF) -related protein 

aaaacactct gtgtggctcc tcggctttga cagagtgcaa gacgatgact tgcaaaatgt 
cgcagctgga acgcaacata gagaccatca tcaacacctt ccaccaatac tctgtgaagc 
tggggcaccc agacaccctg aaccaggggg aattcaaaga gctggtgcga aaagatctgc 
40 aaaattttct caagaaggag aataagaatg aaaaggtcat agaacacatc atggaggacc 
tggacacaaa tgcagacaag cagctgagct tcgaggagtt catcatgctg atggcgaggc 
taacctgggc ctcccacgag aagatgcacg agggtgacga gggccctggc caccaccata 
agccaggcct cggggagggc accccctaag accacagtgg ccaagatcac agtggccacg 
gccacggcca cagtcatggt ggccacggcc acagccaccc at 



SEQ ID NO. 8 



M21064 

50 Human migration inhibitory factor-related protein 14 (MRP14) gene, 
complete cds 

1 atcactgtgg agtaggggaa gggcactcct ggggtggcaa ggtgggaggt gggccctgtg 

61 ttcccacagt gggcagggag gtagtgaaag ggaagctggc cggacaggaa gggccattcc 

55 121 aagagggctt tgtgcgcagg gctaagccaa gctttctcca taggcaatgg ggagcaactg 
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181 gaggttcgta gcaggagaag gacacatcaa gcccaccagg aggctaagta aaaacagttg 
241 tctcccaagt tataagttcc tggaaccctt gctgggagca ggatttagaa aaatgatgct 
3 01 gagagatgct agaaacatat tcgccctgag gctctctcac tcagactgca agaggaaggt 
3 61 atcatcagaa ttgcccttaa ccaggaacca gaatagctgg gtccccttcc tgccaagtca 
5 421 gcaaccagct atgtgacctt gctcaggtcc atctccgggt gtcagtttct tcatctacaa 
481 tgcaagaggg ttgcccacct ctgagaaccc ttctaacccc aaatctcacc ctatgaatct 
541 aagaacacaa cccctcgcca tcctaagtat cacagagcca ggcaagcatg ggtgagagct 
601 cagaccatcc ttgttggact aaaaggaagg ggcagactgc catggggggc agccgagagg 
661 gtcaggccGc cataggtcct cagcctgctt caacctcaaa ggggatgggg ggctgagtgg 
10 721 tgccagagga gcagcaggct cgctcgggga gagtagggcc ttaggataga agggaaatga 
781 actaaacaac cagcttcctg caaaccagtt tcaggccagg gctgggaatt tcacaaaaaa 
841 gcagaaggcg ctctgtgaac atttcctgcc ccgccccagc ccccttcctg gcagcattag 
901 cacactgctc acctgtgaag caatcttccg gagacagggc caaagggcaa gtgccccagt 
961 caggagctgc ctataaatgc cgagcctgca cagctctggc aaacactctg tgtggctcct 
15 1021 cggctttggt aagtgagctg ccagcttccc caggcagaag cctgcctgcc gattccttct 
1081 ttccttccct gacccaactt ccttccaaat cctcctccta gaagccctcc ttggttggcc 
1141 ctgcctactt taaagcttct ttcacatttt cttaggtcat gttcccctgg ggcctcctgc 
1201 Gctcaaatgc tttgcttttt ggcactctgt agatattcta aaaaatcatt ttgtacatgt 
1261 gtgtgacagg ccatctccca gttaagttgc agcctgtgct ttctttttat tttgcacttc 
20 1321 ccccactatt tctgtgagtg cttagtagga agtgtcaaag aagcttgaca gcattttctt 
1381 ctaagtgtcc caactcttgg ttttccatta cacagacaga gtgcaagacg atgacttgca 
1441 aaatgtcgca gctggaacgc aacatagaga ccatcatcaa caccttccac caatactctg 
1501 tgaagctggg gcacccagac accctgaacc agggggaatt caaagagctg gtgcgaaaag 
1561 atctgcaaaa ttttctcaag gtagggctgg actctggcag gtctgaccca gcctcaccgc 
25 1621 agtttgggtt gacaagggag gatgggagta tgggctacag caatcaaggg gaagatttga 
1681 gctcctggag cccagcccca agacgcagcg agtgtcctgt tatacagggc aggtgctcac 
1741 agttacacag gacgacaggg tcaagaaatt gctcaattga acacctgcta tttgtcgggc 
1801 cctgttctgg gcagagggat gtagtggtaa atgggagccc actattccat gaggagacac 
1861 acagtaaagt tgttggccaa taaagagcac agataaagcc aaatgccaat aagtgcctgg 
30 1921 aagaaaatga gatagagtgc gctgtgggca atggggctgg gtggggtgga ggtgaccagt 
1981 tagggtacat gagaagggcc tctttgagga ggtaacattt gagctgagcc ccgaatgttg 

2 041 gggagggaag cccctgagga tgacacttgg cacaaagctg aggagaccct aagcctcagg 
2101 gcgaacttgg ggtggaagac ttgggggctt ttctaatcct aagggtctgc ggtggaaaat 
2161 gaatgcataa agagcacatg gagagcacct gcacagcact cagggaactg ggaggttttt 

35 2221 cccccgctcG aaaaatgatt aggcagttct aagaaaaagg ctgagcactt ccaacagcct 
2281 ttttgttttc ttttcaaatt tggggaaagt cgggaaacag aggccfcgcat taagaagggt 
2341 ggaacacatg ggtctcagtc tcagttccag tcccggagcc agacatcctg gggtaggtcc 
24 01 ccagccctcc cagtgcccct ccctccgcct tggtaaggtg gagaattgca gccttcagag 
2461 ttaggggccc tgacagctct ccataggtgg aggcctcagg caggcaggat gctgggtggg 

40 2521 gtaggcaaga aagggcccag cagagaggcc gcatcggaaa actatcctcc atgtgacccc 
2581 ctatgcccgc ttcacccccc acctgacatc ccccaccaga agcaaagcga tgctgtggga 
2641 aaggaagcag agcctcatgg atgggctgca caggagagtg ctcgcattgg ctgggtaccc 
2701 cacaggttct gggaggggac ttagcgaggt gactcagtgc ctcggcctcc caaagtgctg 
2761 ggattacaag catgagccac cctgtccgac catctcccct tttatacttt atcacaccct 

45 2821 tgaggtcagc ggagcacata ctctgctctc tgaccctcca tctcccctgc ccacacctag 
2881 gtttttctag tgtttccccg ttgtattggt tgaaataagt ttcactaatt ggtaacctcc 
2941 agagggaagg gaagggaggg caggggaagg agtgaagtgc agaggggtag cagagtggaa 

3 0 01 ctggcctcta agtcagatct gaatttgcat gccctcaata gtcaagcctg tgaaaactaa 
3061 tgaccctctc taggactggt ttcaagtctt cctccaggaa gataccattc ctagctgtta 

50 3121 aagttgttat aaggaccaaa tgaggtgaca tttccaggct tactoatgcc atgaccaggg 
3181 caagaccctg gaactcagct tcctcttcta taaatagaga atcagcaccc aagtcacagg 
3241 gtcatggagg gaataaactg gagagcgttt ggtatgtgct cagtgtctgc tccattgtgc 
33 01 gcactcagcc tatggtcatt tttaattttt aaatccagcc ccagggtcga ggcttccttg 
3361 tacatttgcc agctggtcat tfcacfcgtgct cccagtcccc acctcfcggcc acacccagct 

55 3 421 ctcacagcct tctctcccca cccgcagaag gagaataaga atgaaaaggt catagaacac 
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3481 atcatggagg acctggacac aaatgcagac aagcagctga gcttcgagga gttcatcatg 
3541 ctgatggcga ggctaacctg ggcctcccac gagaagatgc acgagggtga cgagggccct 
3601 ggccaccacc ataagccagg octcggggag ggcaccccct aagaccacag tggccaagat 
3 661 cacagtggcc acggccacgg ccacagtcat ggtggccacg gccacaggcc actaatcagg 
5 3721 aggccaggcc accctgcctc tacccaacca gggccccggg gctgttatgt caaactgtct 
3781 tggctgtggg gctaggggct ggggcaaata agtctcttcc tccaagtcag tgctctgtgt 
3841 gcttcttcca cctcttctcc aaccctgcct tcccagggct ctggcattta gacagccctg 
3901 fcccttatctg tgactcagcc ccctcattca gtattaacaa aatgagaagc agcaaaacat 
3961 gggtctgtgc tgggcccctt ggctcacctc cctgaccatg tcctcaccfcc tgacttcagg 

10 4021 CGCcactgtt cagatcccag gctccctgcc ccatctcaga caccctgtcc agcctgtcca 
4081 gcctgacaaa tggcccttgt cactgtacac tgtagaaagc aaaaaggcafc atctctaccc 
4141 cttgatatgc ctgctacctc accaaccagc cccaagcctg tcttcaccca tcactgtcta 
4201 cacagccctc tctctctcct aacagaattc tattcctctg aaagtcttca gaaactggac 
4261 ctagatagtg ccatgtctgg ggaggaatat ggcaccaggc agtggaaaca aggacagatc 

15 4321 ggtgtgttat ctcacatttg atcagagagc atgatctctc ttaacagacc tgccacccta 
43 81 atcaacggga gtgctcacac aagtgggagt ctgagagctt agccctatgc ccaccctgg 



SEQ ID NO. 9 

20 

P01833 and Q81ZY7 

Polymeric- immunoglobulin receptor (precursor) 

1 mllfvltcll avfpaistks pifgpeevns vegnsvsitc yypptsvnrh trkywcrqga 
25 61 rggcitliss egyvsskyag ranltnfpen gtfwniaql sqddsgrykc glginsrgls 
121 fdvslevsqg pgllndtkvy tvdlgrtvti ncpfktenaq krkslykqig lypvlvidss 
181 gyvnpnytgr irldiqgtgg llfswinql rlsdagqylc qagddsnsnk knadlqvlkp 
241 epelvyedlr gsvtfhcalg pevanvakfl crqssgencd wvntlgkra pafegrilln 
3 01 pqdkdgsfsv vitglrkeda grylcgahsd gqlqegspiq awql£vnees tiprsptwk 
30 361 gvagssvavl cpynrkesks ikywclwega qngrcpllvd segvTvkaqye grlslleepg 
421 ngtftvilnq Itsrdagfyw cltngdtlwr ttveikiieg epnlkvpgnv tavlgetlkv 
481 pchfpckfss yekywckwnn tgcqalpsqd egpskafvnc densrlvslt Inlvtradeg 
541 vjywcgvkqgh fygetaavyv aveerkaags rdvslakada apdekvldsg freienkaiq 
6 01 dprlfaeeka vadtrdqadg srasvdsgss eeqggssral vstlvplglv lavgavavgv 
35 661 ararhrknvd rvsirsyrtd ismsdfensr efgandnmga ssitqetslg gkeefvatte 
721 sttetkepkk akrsskeeae maykdfllqs stvaaeaqdg pqea 



SEQ XD NO. 10 

40 

NM_002644 

Homo sapiens polymeric immunoglobulin receptor (PIGR) , mRNA 

1 agagtttcag .ttttggcagc agcgtccagt gccctgccag tagctcctag agaggcaggg 

45 61 gttaccaact ggccagcagg ctgtgtccct gaagtcagat caacgggaga gaaggaagtg 

121 gctaaaacat tgcacaggag aagtcggcct gagtggtgcg gcgctcggga cccaccagca 

181 atgctgctct tcgtgctcac ctgcctgctg gcggtcttcc cagccatcfcc cacgaagagt 

241 cccatatttg gtcccgagga ggtgaatagt gtggaaggta actcagtgtc catcacgtgc 

3 01 tactacccac ccacctctgt caaccggcac acccggaagt actggtgccg gcagggagct 

50 361 agaggtggct gcataaccct catctcctcg gagggctacg tctccagcaa atatgcaggc 

421 agggctaacG tcaccaactt cccggagaac ggcacatttg tggtgaacat tgcccagctg 

481 agccaggatg actccgggcg ctacaagtgt ggcctgggca tcaatagccg aggcctgtcc 

541 tttgatgtca gcctggaggt cagccagggt cctgggctcc taaatgacac taaagtctac 

601 acagtggacc tgggcagaac ggtgaccatc aacfcgccctt tcaagactga gaatgctcaa 

55 661 aagaggaagt ccttgtacaa gcagataggc ctgtaccctg tgctggtcat cgactccagt 
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721 ggttatgtaa atcccaacta tacaggaaga atacgccttg atattcaggg tactggccag 
781 ttactgttca gcgttgtcat caaccaactc aggctcagcg atgctgggca gtatctctgc 
841 caggctgggg atgattccaa tagtaataag aagaatgctg acctccaagt gctaaagccc 
901 gagcccgagc tggtttatga agacctgagg ggctcagtga ccttccactg tgccctgggc 
5 961 cctgaggtgg caaacgtggc caaatttctg tgccgacaga gcagtgggga aaactgtgac 
1021 gtggtcgtca acaccctggg gaagagggcc ccagcctttg agggcaggat cctgctcaac 
1081 ccccaggaca aggatggctc attcagtgtg gtgatcacag gcctgaggaa ggaggatgca 
1141 gggcgctacc tgtgtggagc ccattcggat ggtcagctgc aggaaggctc gcctatccag 
1201 gcctggcaac tcttcgtcaa tgaggagtcc acgattcccc gcagccccac tgtggtgaag 
10 12 61 ggggtggcag gaggctctgt ggccgtgctc tgcccctaca accgtaagga aagcaaaagc 
1321 atcaagtact ggtgtctctg ggaaggggcc cagaatggcc gctgccccct gctggtggac 
13 81 agcgaggggt gggttaaggc ccagtacgag ggccgcctct ccctgctgga ggagccaggc 
1441 aacggcacct tcactgtcat cctcaaccag ctcaccagcc gggacgccgg cttctactgg 
15 01 tgtctgacca acggcgatac tctctggagg accaccgtgg agatcaagat tatcgaagga 
15 15 61 gaaccaaacc tcaaggtacc agggaatgtc acggctgtgc tgggagagac tctcaaggtc 
1621 ccctgtcact ttccatgcaa attctcctcg tacgagaaat actggtgcaa gtggaataac 
1681 acgggctgcc aggccctgcc cagccaagac gaaggcccca gcaaggcctt cgtgaactgt 
1741 gacgagaaca gccggcttgt ctccctgacc ctgaacctgg tgaccagggc tgatgagggc 
18 01 tggtactggt gtggagtgaa gcagggccac ttctatggag agactgcagc cgtctatgtg 
20 1861 gcagttgaag agaggaaggc agcggggtcc cgcgatgtca gcctagcgaa ggcagacgct 
1921 gctcctgatg agaaggtgct agactctggt tttcgggaga ttgagaacaa agccattcag 
1981 gatcccaggc ttttfcgcaga ggaaaaggcg gtggcagata caagagatca agccgatggg 

2 041 agcagagcat ctgtggattc cggcagctct gaggaacaag gtggaagctc cagagcgctg 
2101 gtctccaccc tggtgcccct gggcctggtg ctggcagtgg gagccgtggc tgtgggggtg 

25 2161 gccagagccc ggcacaggaa gaacgtcgac cgagtttcaa tcagaagcta caggacagac 
2221 attagcatgt cagacttcga gaactccagg gaatttggag ccaatgacaa catgggagcc 
22 81 tcttcgatca ctcaggagac atccctcgga ggaaaagaag agtttgttgc caccactgag 
2341 agcaccacag agaccaaaga acccaagaag gcaaaaaggt catccaagga ggaagccgag 
24 01 atggcctaca aagacttcct gctccagtcc agcaccgtgg ccgccgaggc ccaggacggc 

30 2461 ccccaggaag cctagacggt gtcgccgcct gctccctgca cccatgacaa tcaccttcag 
2521 aatcatgtcg atcctggggc cctcagctcc tggggacccc actccctgct ctaacacctg 
2581 cctaggtttt tcctactgtc ctcagaggcg tgctggtccc ctcctcagtg acatcaaagc 
2641 ctggcctaat tgttcctatt ggggatgagg gtggcatgag gaggtcccac ttgcaacttc 
2701 tttctgttga gagaacctca ggtacggaga agaatagagg tcctcatggg tcccttgaag 

35 2761 gaagagggac cagggtggga gagctgattg cagaaaggag agacgtgcag cgcccctctg 
2821 cacccttatc atgggatgtc aacagaattt ttccctccac tccatccctc cctcccgtcc 
2881 ttcccctctt cttctttcct tccatcaaaa gatgtatttg aattcatact agaattcagg 
2941 tgctttgcta gatgcfcgtga caggtatgcc accaacactg ctcacagcct ttctgaggac 

3 001 accagtgaaa gaagccacag ctcttcttgg cgtatttata ctcactgagt cttaactttt 
40 3 061 caccaggggt gctcacctct gcccctattg ggagaggtca taaaatgtct cgagtcctaa 

3121 ggccttaggg gtcatgtatg atgagcatac acacaggtaa ttataaaccc acattcttac 
3181 catttaacac ataagaaaat tgaggtttgg aagagtgaag cgtttttctt tttctttttt 
3241 ttttttgaga cggagtctct cactgtcgcc caggctggag tgcagtggcg caatctcggc 
33 01 tcactgcaac ctccgcctcc caggttgaca ccattctcct gcctcaccct cccaagtagc 

45 33 61 tgggactaca ggcgcctgcc agcacgcctg gctaattttt tgtattttta gtagagacag 
3421 ggtttcaccg tgttagccag gatggtctcg atctcctgac ctcgtgatcc gcctgcctct 
3481 gcctcccaaa gtgctgggat tacaggcgtg agccaccgcg tccggcctct ttttttcttt 
3541 tctttttttt gagacaaagt ctcactgtgt cacccagact ggaatgcagt gacacaatct 
3601 cggctcactg aaacctctgc cttccaggtt caagctattc tcatgcctca gcctctcaag 

50 3661 tagctgggac tacagatgtg ggccaccatg tctggctaat tttttttttt tttttttttt 
3 721 tttgtagaga cagggtttcg ccatgttgac gagactggtc tcgaactcct ggcctcaagt 
3 781 gatctgccgc ctcagcttct caaagtactg ggattatata ggcatgagcc actgagcctg 
3 841 gcGctgaagc gtttttctca aaggccctca gtgagataaa ttagatttgg catctcctgt 
3901 cctgggccag ggatctctct acaagagccc cfcgcccctcfc gttggaggca cagttfctaga 

55 3 961 ataaggagga ggagggagaa gagaaaatgt aaaggaggga gatctttccc aggccgcacc 
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4021 atttctgtca ctcacatgga cccaagataa aagaatggcc aaaccctcac aacccctgat 
4 0 81 gtttgaagag ttccaagttg aagggaaaca aagaagtgtt tgatggtgcc agagaggggc 
4141 tgctctccag aaagctaaaa tfctaatttct tttttcctct gagttcfcgta cttcaaccag 
42 01 cctacaagct ggcacttgct aacaaatcag aaatatgaca attaatgatt aaagactgtg 
5 4261 attgcc 



SEQ ID NO- 11 



10 P3 008 6 " Homo sapiens 

Phosphatidylethanolamine binding protein (PEBP) 

1 mpvdlskwsg plslgevdeq pqhplhvtya gaavdelgkv Itptqvknrp tsiswdglds 
61 gklytlvltd pdapsrkdpk yrewhhflvv nmkgndissg tvlsdyvgsg ppkgtglhry 
15 121 vwlvyeqdrp Ikcdepilsn rsgdhrgkfk vasfrkkyel rapvagtcyq aewddyvpkl 
181 yeqlsgk 



SEQ ID NO. 12 

20 

NM_002567 

Homo sapiens prostatic binding protein (PBP) , mRNA 

1 tgggcggcgg ctgaggcgcg tgctctcgcg tggtcgctgg gtctgcgtct tcccgagcca 

25 61 gtgtgctgag ctctccgcgt cgcctctgtc gcccgcgcct ggcctaccgc ggcactcccg 

121 gctgcacgct ctgcttggcc tcgccahgcc ggtggacctc agcaagtggt ccgggccctt 

181 gagcctgcaa gaagtggacg agcagccgca gcacccgctg catgtcacct acgccggggc 

241 ggcggtggac gagctgggca aagtgctgac gcccacccag gttaagaata gacccaccag 

3 01 catttcgtgg gatggtcttg attcagggaa gctctacacc ttggtcctga cagacccgga 
30 3 61 tgctcGcagc aggaaggatc ccaaatacag agaatggcat catttcctgg tggtcaacat 

421 gaagggcaat gacatcagca gtggcacagt cctctccgat tatgtgggct cggggcctcc 

4 81 caagggcaca ggcctccacc gctatgtctg gctggtttac gagcaggaca ggccgctaaa 
541 gtgtgacgag cccatcctca gcaaccgatc tggagaccac cgtggcaaat tcaaggtggc 
601 gtccttccgt aaaaagtatg agctcagggc cccggtggct ggcacgtgtt accaggccga 

35 661 gtgggatgac tatgtgccca aactgtacga gcagctgtct gggaagtagg gggttagctt 

721 ggggacctga actgtcctgg aggccccaag ccatgttccc cagttcagtg ttgcatgtat 

781 aatagatttc tcctcttcct gccccccttg gcatgggtga gacctgacca gtcagatggt 

841 agttgagggt gacttttcct gctgcctggc ctttataatt ttactcactc actctgattt 

901 atgttttgat caaatttgaa cttcattttg gggggtattt tggtactgtg atggggtcat 

40 961 caaattatta atctgaaaat agcaacccag aatgtaaaaa agaaaaaact ggggggaaaa 

1021 agaccaggtc tacagtgata gagcaaagca tcaaagaatc tttaagggag gtttaaaaaa 

1081 aaaaaaaaaa aaaaagattg gttgcctctg cctttgtgat 'cctgagtcca gaatggtaca 

1141 caatgtgatt ttatggtgat gtcactcacc tagacaacca gaggctggca ttgaggctaa 

1201 cctccaacac agtgcatctc agatgcctca gtaggcatca gtatgtcact ctggtccctt 

45 1261 taaagagcaa tcctggaaga agcaggaggg agggtggctt tgctgttgtt gggacatggc 

13 21 aatctagacc ggtagcagcg ctcgctgaca gcttgggagg aaacctgaga tctgtgtttt 

1381 ttaaattgat cgttcttcat gggggtaaga aaagctggtc tggagttgct gaatgttgca 

1441 ttaattgtgc tgtttgcttg tagttgaata aaaatagaaa cctgaatgaa gaaaaaaaaa 

15 01 aaaaaaa 

50 



SEQ ID NO. 13 



P39687 ~ Homo sapiens 
55 Acidic leucine-rich nuclear phosphoprotein 32 family member A 
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1 memgrritile Irnrtpsdvk elvldnsrsn egklegltde feeleflsti nvgltsianl 
61 pklnklkkle Isdnrvsggl evlaekcpnl thlnlsgnki kdlstieplk klenlksldl 
121 fncevtnlnd yrenvfkllp qltyldgydr ddkeapdsda egyvegldde eededeeeyd 
181 edaqwedee dedeeeegee edvsgeeeed eegyndgevd deedeeelge eergqkrkare 

241 pedegeddd 



SEQ ID NO. 14 

NM_006305 

Homo sapiens acidic (leucine-rich) nuclear phosphoprotein 32 
family, member A (ANP32A) , mJRNA 

15 1 cgggtgctgg gggctcgaga accgagcgga gctggttgag ccttcaaagt cctaaaacgc 

61 gcggccgtgg gttcggggtt tattgattga attccgccgg cgcgggagcc tctgcagaga 

121 gagagcgcga gagatggaga tgggcagacg gattcattta gagctgcgga acaggacgcc 

181 ctctgatgtg aaagaacttg tcctggacaa cagt cggtcg aatgaaggca aactcgaagg 

241 cctcacagat gaatttgaag aactggaatt cttaagtaca atcaacgtag gcctcacctc 

20 301 aatcgcaaac ttaccaaagt taaacaaact taagaagctt gaactaagcg ataacagagt 

361 ctcagggggc ctggaagtat tggcagaaaa gtgtccgaac ctcacgcatc taaatttaag 

421 tggcaacaaa afctaaagacc tcagcacaat agagccactg aaaaagttag aaaacctcaa 

481 gagcttagac cttttcaatt gcgaggtaac caacctgaac gactaccgag aaaatgtgtt 

541 caagctcctc ccgcaactca catatctcga cggctatgac cgggacgaca aggaggcccc 

25 601 tgactcggat gctgagggct acgtggaggg cctggatgat gaggaggagg atgaggatga 

661 ggaggagtat gatgaagatg ctcaggtagt ggaagacgag gaggacgagg atgaggagga 

721 ggaaggtgaa gaggaggacg tgagtggaga ggaggaggag gatgaagaag gttataacga 

781 tggagaggta gatgacgagg aagatgaaga agagcttggt gaagaagaaa ggggtcagaa 

841 gcgaaaacga gaacctgaag atgagggaga agatgatgac taagtggaat aacctatttt 

30 901 gaaaaattcc tattgtgatt tgactgtttt tacccatatc ccctctcccc cccccctcca 

961 atccfcgcccc ctgaaactta tttttttctg attgtaacgt tgctgtggga acgagagggg 

1021 aagagtgtac tgggggttgc ggggggaggg atggcgggtg ggggtggaat aaaatactat 

1081 ttttactgcc actctttaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 



35 



40 



55 



SEQ ID NO. 15 

P17066 - Homo sapiens 
Heat shock 7 0kDa protein 



1 mqaprelavg idlgttyscv gvfqggrvei landqgnrtt psyvaftdte rlvgdaaksq 
61 aalnphntvf dakrligrkf adttvqsdmk h.wpfrwseg gkpkvrvcyr gedktfypee 
121 issmvlskrak etaeaylgqp vkhavitvpa yfndsqrqat kdagaiagln vlriinepta 
181 aaiaygldrr gagernvlif dlgggtfdvs vlsidagvfe vkatagdthl ggedfdnrlv 
45 241 nlifmeefrrk hgkdlsgnkr alrrlrtace rakrtlssst qatleidslf egvdfytsit 
3 01 rarfeelcsd Ifrstlepve kalrdakldk aqihdwlvg gstripkvqk llqdffngke 

3 61 Inksinpdea vaygaavqaa vlmgdkcekv qdlllldvap Islgletagg vmttliqrna 
421 tiptkqtqtf ttysdnqpgv fiqvyegera mtkdnnllgr felsgippap rgvpqievtf 

4 81 didangilsv tatdrstgka nkititndkg rlskeeverm vheaeqykae deaqrdrvaa 
50 541 knsleahvfh vkgslqeesl rdkipeedrr kmqdkcrevl awlehnqlae keeyehqkre 

6 01 leqicrpifs rlyggpgvpg gsscgtqarg gdpstgpiie evd 
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NM_002155 

Homo sapiens heat shock 70kDa protein 6 (HSP70B') (HSPA6) , raRNA. 

1 agagccagcc cggaggagct agaaccttcc ccgcatttct ttcagcagcc tgagtcagag 
5 61 gcgggctggc ctggcgtagc cgcccagcct cgcggctcat gccccgatct gcccgaacct 

121 tctcccgggg tcagcgccgc gccgcgccac ccggctgagt cagcccgggc gggcgagagg 
181 ctctcaactg ggcgggaagg tgcgggaagg tgcggaaagg ttcgcgaaag ttcgcggcgg 
241 cgggggtcgg gtgaggcgca aaaggataaa aagcccgtgg aagcggagct gagcagatcc 
301 gagccgggct ggctgcagag aaaccgcagg gagagcctca ctgctgagcg cccctcgacg 
10 361 gcggagcggc agcagcctcc gtggcctcca gcatccgaca agaagcttca gccatgcagg 
421 ccccacggga gctcgcggtg ggcatcgacc tgggcaccac ctactcgtgc gtgggcgtgt 
481 ttcagcaggg ccgcgtggag atcctggcca acgaccaggg caaccgcacc acgcccagct 
541 acgtggcctt caccgacacc gagcggctgg tcggggacgc ggccaagagc caggcggccc 
601 tgaacGccca caacaccgtg ttcgatgcca agcggctgat cgggcgcaag ttcgcggaca 
15 661 ccacggfcgca gtcggacatg aagcactggc ccttccgggt ggtgagcgag ggcggcaagc 
721 ccaaggtgcg cgtatgctac cgcggggagg acaagacgtt ctaccccgag gagatctcgt 
781 ccatggtgct gagcaagatg aaggagacgg ccgaggcgta cctgggccag cccgtgaagc 
841 acgcagtgat caccgtgccc gcctatttca atgactcgca gcgccaggcc accaaggacg 
901 cgggggccat cgcggggctc aacgtgttgc ggatcatcaa tgagcccacg gcagctgcca 
20 961 tcgcctatgg gctggaccgg cggggcgcgg gagagcgcaa cgtgctcatt tttgacctgg 
1021 gtgggggcac cttcgatgtg tcggttctct ccattgacgc tggtgtcttt gaggtgaaag 
1081 ccacfcgctgg agatacccac ctgggaggag aggacttcga caaccggctc gtgaaccact 
1141 tcatggaaga attccggcgg aagcatggga aggacctgag cgggaacaag cgtgccctgc 
1201 gcaggctgcg cacagcctgt gagcgcgcca agcgcaccct gtcctccagc acccaggcca 
25 1261 ccctggagat agactccctg ttcgagggcg tggacttcta cacgtccatc actcgtgccc 
1321 gctttgagga actgtgctca gacctcttcc gcagcaccct ggagccggtg gagaaggccc 
1381 tgcgggatgc caagctggac aaggcccaga ttcatgacgt cgtcctggtg gggggctcca 
1441 ctcgcatccc caaggtgcag aagttgctgc aggacttctt caacggcaag gagctgaaca 
1501 agagcatcaa ccctgatgag gctgtggcct atggggctgc tgtgcaggcg gccgtgttga 
30 1561 tgggggacaa atgtgagaaa gtgcaggatc tcctgctgct ggatgtggct cccctgtctc 
1621 tggggctgga gacagcaggt ggggtgatga ccacgctgat ccagaggaac gccactatcc 
1681 ccaccaagca gacccagact ttcaccacct actcggacaa ccagcctggg gtcttcatcc 
1741 aggtgtatga gggtgagagg gccatgacca aggacaacaa cctgctgggg cgttttgaac 
1801 tcagtggcat ccctcctgcc ccacgtggag tcccccagat agaggtgacc tttgacattg 
35 1861 atgctaatgg catcctgagc gtgacagcca ctgacaggag cacaggtaag gctaacaaga 
1921 tcaccatcac caatgacaag ggccggctga gcaaggagga ggtggagagg atggttcatg 
1981 aagccgagca gtacaaggct gaggatgagg cccagaggga cagagtggct gccaaaaact 
2 041 cgctggaggc ccatgtcttc catgtgaaag gttctttgca agaggaaagc cttagggaca 
2101 agattcccga agaggacagg cgcaaaatgc aagacaagtg tcgggaagtc cttgcctggc 
40 2161 tggagcacaa ccagctggca gagaaggagg agtatgagca tcagaagagg gagctggagc 
2221 aaatctgtcg ccccatcttc tccaggctct atggggggcc tggtgtccct gggggcagca 
2281 gttgtggcac tcaagcccgc cagggggacc ccagcaccgg ccccatcafct gaggaggttg 
2341 attgaatggc ccttcgtgat aagtcagctg tgactgtcag ggctatgcta tgggccttct 
2401 agactgtctt ctatgatcct gcccttcaga gatgaacttt ccctccaaag ctagaacttt 
45 2461 Gttcccagga taactgaagt cttttgactt tttgggggga gggcggttca tcctcttctg 
2521 cttcaaataa aaagtcatta atttattaaa acttgtgtgg cactttaaca ttgctttcac 
2581 ctatattttg tgtactttgt tacttgcatg tatgaatttt gttatgtaaa atatagttat 
2 641 agacctaaat aaaaaaaaaa aaaa 



50 



55 



SEQ ID NO. 17 

X51757 

Human heat -shock protein HSP70B gene 
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1 cccgggcggg cgagaggctc tcaactgggc gggaaggtgc gggaaggtgc ggaaaggttc 
61 gcgaaagttc gcggcggcgg gggtcgggtg aggcgcaaaa ggataaaaag cccgtggaag 
121 cggagctgag cagafcccgag ccgggctggc tgcagagaca ccgcagggag agcctcactg 

181 ctgagcgccc ctcgacggcg gacgggcagc agcctccgtg gcctccagca tccgacaaga 
5 241 agcttcagcc atgcaggccc cacgggagct cgcggtgggc atcgacctgg gcaccaccta 

3 01 ctcgtgcgtg ggcgtgtttc agcagggccg cgtggagatc ctggccaacg accagggcaa 
3 61 ccgcaccacg cccagctacg tggccttcac cgacaccgag cggctggtcg gggacgcggc 

421 caagagccag gcggccctga acccccacaa caccgtgttc gatgccaagc ggctgatcgg 

481 gcgcaagttc gcggacacca cggtgcagtc ggacatgaag cactggccct tccgggtggt 

10 541 gagcgagggc ggcaagccca aggtgccggt atcgtaccgc ggggaggaca agacgttcta 

601 ccccgaggag atctcgtcca tggtgctgag caagatgaag gagacggccg aggcgtacct 

661 gggccagccc gtgaagcacg cagtgatcac cgtgcccgcc tatttcaatg actcgoagcg 

721 ccaggccacc aaggacgcgg gggccatcgc ggggctcaac gtgttgcgga tcatcaatga 

781 gcccacggca gctgccatcg cctatgggct ggaccggcgg ggcgcgggag agcgcaacgt 

15 841 gctcattttt gacctgggtg ggggcacctt cgatgtgtcg gttctctcca ttgacgctgg 

901 tgtctttgag gtgaaagcca ctgctggaga tacccacctg ggaggagagg acttcgacaa 

961 ccggctcgtg aaccacttca tggaagaatt ccggcggaag catgggaagg acctgagcgg 

1021 gaacaagcgt gccctcggca ggctgcgcac agcctgtgag cgcgccaagc gcaccctgtc 

1081 ctccagcacc caggccaccc tggagataga ctccctgttc gagggcgtgg acttctacac 

20 1141 gtccatcact cgtgcccgct ttgaggaact gtgctcagac ctcttccgca gcaccctgga 

12 01 gccggtggag aaggccctgc gggatgccaa gctggacaag gcccagattc atgacgtcgt 
1261 cctggtgggg ggctccactc gcatccccaa ggtgcagaag ttgctgcagg acttcttcaa 
1321 cggcaaggag ctgaacaaga gcatcaaccc tgatgaggct gtggcctatg gggctgctgt 

13 81 gcaggcggcc gtgttgatgg gggacaaatg tgagaaagtg caggatctcc tgctgctgga 
25 1441 tgtggctccc ctgtctctgg ggctggagac agcaggtggg gtgatgacca cgctgatcca 

15 01 gaggaacgcc actatcccca ccaagcagac ccagactttc accacctact cggacaacca 

1561 gcctggggtc ttcatccagg tgtatgaggg tgagagggcc atgaccaagg acaacaacct 

1621 gctggggcgt tttgaactca gtggcatccc tcctgcccca cgtggagtcc cccagataga 

1681 ggtgaccttt gacattgatg ctaatggcat cctgagcgtg acagccactg acaggagcac 

30 1741 aggtaaggct aacaagatca ccafccaccaa tgacaagggc cggctgagca aggaggaggt 

1801 ggagaggatg gttcatgaag ccgagcagta caaggctgag gatgaggccc agagggacag 

1861 agtggctgcc aaaaactcgc tggaggccca tgtcttccat gtgaaaggtt ctttgcaaga 

1921 ggaaagcctt agggacaaga ttcccgaaga ggacaggcgc aaaatgcaag acaagtgtcg 

1981 ggaagtcctt gcctggctgg agcacaacca gctggcagag aaggaggagt atgagcatca 

35 2041 gaagagggag ctggagcaaa tctgtcgccc catcttctcc aggctctatg gggggcctgg 

2101 tgtccctggg ggcagcagtt gtggcactca agcccgccag ggggacccca gcaccggccc 

2161 catcattgag gaggttgatt gaatggccct tcgtgataag tcagctgtga ctgtcagggc 

2221 tatgcbatgg gccttctaga ctgtcttcta tgatcctgcc cttcagagat gaactttccc 

2281 tccaaagcta gaactttctt cccaggataa ctgaagtctt ttgacttttt gcggggaggg 

40 2341 cggttcatcc tcttctgctt caaataaaaa gtcattaatt tattaaaact tgtgtggcac 

2401 tttaacattg ctttcaccta tattttgtgt actttgttac ttgcatgtat gaattttgtt 
2461 atgtaaaata tagttataga cctaaataag ct 

45 SEQ ID NO. 18 

P14174 

macrophage migration inhibitory factor - Homo Sapiens 

50 1 mpmfivntnv prasvpdgfl seltqqlaga tgkppqylav hwpdqlmaf ggssepcalc 
61 slhsigkigg aqnrsyskll cgllaerlri spdrvyinyy dmnaanvgwn nstfa 
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NM_^002415 - Homo Sapiens 

Homo sapiens macrophage migration intiibitory factor 
(glycosylation-inhibiting factor) (MIF) , mRNA 

5 1 accacagtgg tgtccgagaa gtcaggcacg tagctcagcg gcggccgcgg cgcgtgcgtc 

61 tgtgcctctg cgcgggtctc ctggtccttc tgccatcatg ccgatgttca tcgtaaacac 
121 caacgtgccc cgcgcctccg tgccggacgg gttcctctcc gagctcaccc agcagctggc 
181 gcaggccacc ggcaagcccc cccagtacat cgcggtgcac gtggtcccgg accagctcat 
241 ggccttcggc ggctccagcg agccgtgcgc gctctgcagc ctgcacagca tcggcaagat 
10 3 01 cggcggcgcg cagaaccgct cctacagcaa gctgctgtgc ggcctgctgg ccgagcgcct 
361 gcgcatcagc ccggacaggg tctacatcaa ctattacgac atgaacgcgg ccaatgtggg 
421 ctggaacaac tccaccttcg cctaagagcc gcagggaccc acgctgtctg cgctggctcc 
481 acccgggaac ccgccgcacg ctgtgttcta ggcccgccca ccccaacctt ctggtgggga 
541 gaaataaacg gtttagagac t 



15 



SEQ XD NO. 20 



L19686 

20 Homo sapiens macropliage migration inhibitory factor (MIF) gene, 
complete cds 

1 ctgcaggaac caatacccat aggctatttg tataaatggg ccatggggcc tcccagctgg 
61 aggctggctg gtgccacgag ggtcccacag gcatgggtgt ccttcctata tcacatggcc 
25 121 ttcactgaga ctggtatatg gattgcacct atcagagacc aaggacagga cctccctgga 
181 aatctctgag gacctggcct gtgatccagt tgctgccttg tcctcttcct gctatgtcat 
241 ggcttatctt ctttcaccca ttcattcatt cattcattca ttcagcagta ttagtcaatg 
301 tctcttgata tgcctggcac ctgctagatg gtccccgagt ttaccattag tggaaaagac 
361 atttaagaaa ttcaccaagg gctctatgag aggccataca cggtggacct gactagggtg 
30 421 tggcttccct gaggagctga agttgcccag aggcccagag aaggggagct gagcacgttt 
481 gaaccactga acctgctctg gacctcgcct ccttccttcg gtgcctccca gcatcctatc 
541 ctctttaaag agcaggggtt cagggaagtt ccctggatgg tgattcgcag gggcagctcc 
601 cctctcacct gccgcatgac taccccgccc catctcaaac acacaagctc acgcatgcgg 
661 gactggagcc cttgaggaca tgtggcccaa agacaggagg tacaggggct cagtgcgtgc 
35 721 agtggaatga actgggcttc atctctggaa gggtaagggg ccatcttccg ggttcaccgc 
781 cgcatcccca cccccggcac agcgcctcct ggcgactaac atcggtgact tagtgaaagg 
841 actaagaaag acccgaggcg aggccggaac aggccgattt ctagccgcca agtggagaac 
901 aggttggagc ggtgcgccgg gcttagcggc ggttgctgga ggaacgggcg gagtcgccca 
961 gggtcctgcc ctgcgggggt cgagccgagg caggcggtga cttccccact cggggcggag 
40 1021 ccgcagcctc gcgggggcgg ggcctggcgc cggcggtggc gtcacaaaag gcgggaccac 
1081 agtggtgtcc gagaagtcag gcacgtagct cagcggcggc cgcggcgcgt gcgtctgtgc 
1141 ctctgcgcgg gtctcctggt ccttctgcca tcatgccgat gttcatcgta aacaccaacg 
1201 tgccccgcgc ctccgtgccg gacgggttcc tctccgagct cacccagcag ctggcgcagg 
1261 ccaccggcaa gcccccccag gtttgccggg aggggacagg aagagggggg tgcccaccgg 
45 1321 acgaggggtt ccgcgctggg agctggggag gcgactcctg aacggagctg gggggcgggg 
1381 cggggggagg acggtggctc gggcccgaag tggacgttcg gggcccgacg aggtcgctgg 
1441 ggcgggctga ccgcgccctt tcctcgcagt acatcgcggt gcacgtggtc ccggaccagc 
15 01 tcatggcctt cggcggctcc agcgagccgt gcgcgctctg cagcctgcac agcatcggca 
1561 agatcggcgg cgcgcagaac cgctcctaca gcaagctgct gtgcggcctg ctggccgagc 
50 1621 gcctgcgcat cagcccggac aggtacgcgg agtcgcggag gggcggggga ggggcggcgg 
1681 cgcgcggcca ggcccgggac tgagccaccc gctgagtccg gcctcctccc cccgcagggt 
1741 ctacatcaac tattacgaca tgaacgcggc caatgtgggc tggaacaact ccaccttcgc 
1801 ctaagagccg cagggaccca cgctgtctgc gctggctcca cccgggaacc cgccgcacgc 
1861 tgtgttctag gcccgcccac cccaaccttc tggtggggag aaataaacgg tttagagact 
55 1921 aggagtgcct cggggttcct tggcttgcgg gaggaattgg tgcagagccg ggacattggg 
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1981 gagcgaggtc gggaaacggt gttgggggcg ggggtcaggg ccgggttgct ctcctcgaac 

2 041 ctgctgttcg ggagcccttt tgtccagcct gtccctccta cgctcctaac agaggagccc 

2101 cagtgtctfct ccattctatg gcgtacgaag ggatgaggag aagttggcac tctgccctgg 
2161 gctgcag 



SEQ XD NO. 21 

P31949 

10 Calgizzarin - Homo sapiens 

1 makissptet erciesliav fqkyagkdgy nytlsktefl sfirmtelaaf tknqkdpgvl 
61 drmmkkldtn sdgqldfsef Inligglama chdsflkavp sqkrt 



15 



SEQ XD NO. 22 



NM_005 62 0 and D3 8583 - Homo sapiens 

Homo sapiens SlOO calcium binding protein All (calgizzarin) (SlOOAll) , 
20 mRNA 

1 gggcaaggct gggccgggaa gggcgtgggt fcgaggagagg ctccagaccc gcacgccgcg 
61 cgcacagagc tctcagcgcc gctcccagcc acagcctccc gcgcctcgct cagctccaac 
121 atggcaaaaa tctccagccc tacagagact gagcggtgca tcgagtccct gattgctgtc 

25 181 ttccagaagt atgctggaaa ggatggttat aactacactc tctccaagac agagttccta 
241 agcttcatga atacagaacfc agctgccttc acaaagaacc agaaggaccc tggtgtcctt 
3 01 gaccgcatga tgaagaaact ggacaccaac agtgatggtc agctagattt ctcagaattt 
3 61 cttaatctga ttggtggcct agctatggct tgccatgact ccttcctcaa ggctgtccct 
421 tcccagaagc ggacctgagg accccttggc cctggccttc aaacccaccc cctttccttc 

30 481 cagcctttct gtcatcatct ccacagccca cccatcccct gagcacacta accacctcat 
541 gcaggcccca cctgccaata gtaataaagc aatgtcactt ttttaaaaca tgaaa 



35 



45 



50 



SEQ XD NO. 23 

P0093 8 and NP_000356 - Homo sapiens 
Triosephosphate isomerase 



1 mapsrkffvg gnwkmngrkq slgeligtln aakvpadtev vcapptayid farqkldpki 
40 61 avaaqncykv tngaftgeis pgmikdcgat wwlghserr hvfgesdeli gqkvahalae 
121 glgviacige kldereagit ekwfeqtkv iadnvkdwsk wlayepvwa igtgktatpq 
181 qaqevheklr gwlksnvsda vaqstriiyg gsvtgatcke lasqpdvdgf Ivggaslkpe 
241 fvdiinakq 



SEQ XD NO. 24 

Niyi_0 0 03 65 

Homo sapiens triosephosphate isomerase l (XPIl) , mRNA 



1 ccttcagcgc ctcggctcca gcgccatggc gccctccagg aagttcttcg ttgggggaaa 

61 ctggaagatg aacgggcgga agcagagtct gggggagctc atcggcactc tgaacgcggc 

121 caaggtgccg gccgacaccg aggtggtttg tgctccccct actgcctata tcgacttcgc 

181 ccggcagaag ctagatccca agattgctgfc ggctgcgcag aactgctaca aagfcgactaa 

55 241 tggggctttt actggggaga tcagccctgg catgatcaaa gactgcggag ccacgtgggt 
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3 01 ggtcctgggg cactcagaga gaaggcatgt ctttggggag tcagatgagc tgattgggca 
3 61 gaaagtggcc catgctctgg cagagggact cggagtaatc gcctgcattg gggagaagct 
421 agatgaaagg gaagcfcggca tcactgagaa ggttgttttc gagcagacaa aggtcatcgc 
481 agataacgtg aaggactgga gcaaggtcgt cc tggcctat gagcctgfcgt gggccattgg 
5 541 tactggcaag actgcaacac cccaacaggc ccaggaagta cacgagaagc tccgaggatg 
601 gctgaagtcc aacgtctctg atgcggtggc tcagagcacc cgtatcattt atggaggctc 
661 tgtgactggg gcaacctgca aggagctggc cagccagcct gatgtggatg gcttccttgt 
721 gggtggtgct tccctcaagc ccgaattcgt ggacatcatc aatgccaaac aatgagcccc 
781 atccatcttc cctacccttc ctgccaagcc agggactaag cagcccagaa gcccagtaac 
10 841 tgccctttcc ctgcatatgc ttctgatggt gtcatctgct ccttcctgtg gcctcatcca 
901 aactgtatct tcctfctactg tttatatctt caccctgtaa tggttgggac caggccaatc 
961 ccttctccac ttactataat ggttggaact aaacgtcacc aaggtggctt ctccttggct 
1021 gagagatgga aggcgtggtg ggatttgctc ctgggttccc taggccctag tgagggcaga 
10 81 agagaaacca tcctctccct tcttacaccg tgaggccaag atcccctcag aaggcaggag 
15 1141 tgctgccctc tcccatggtg cccgtgcctc tgtgctgtgt atgtgaacca cccatgtgag 
1201 ggaataaacc tggcactagg aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 



SEQ It) NO. 25 

20 

X69723 

H. sapiens TPll gene for trios ephosphate isomerase- 

1 ctgcagttcc tgccaggcct tgccagccgg ggcgagggtt gggatgatcc tggcggccta 

25 61 tgcctgtgtg ggctgcccct cccgctgtga accctgcatt tgtcccgcaa gttttcactc 

121 aggtagactc cctgggtaca agggtgcctg ctcagcagtc gggcatgagc tgctccgatg 

181 ggcgaaggag gttgtctatt ccacagttgg agaggggccc tctctgcccc agtgggcgat 

241 ctgggctacg gccaagttgc caccagctag ttccgcttga aaaccacttc tggccccgtg 

3 01 ggggactcaa gtcgccaagc gagggttccc ctgagcgccg gagctcacag gtctcgcctt 

30 361 gtcccgaaag ccccgcaafcc gaggcggagg cgaccgagcc cccgactctc ctagaacgtt 

421 gccacaagaa gggggaacgt cggaacagtg catcatcggg cggcggccgg ggcggcggca 

481 ggagggcggg cggggggcag ggctccgggg gactgggcgg gccatggcgg aggacggcga 

541 ggaggcggag ttccacttcg cggcgctcta tataagtggg cagtggccgc gactgcgcgc 

601 agacactgac cttcagcgcc tcggctccag cgccatggcg ccctccagga agttcttcgt 

35 661 tgggggaaac tggaagatga acgggcggaa gcagagtctg ggggagctca tcggcactct 

721 gaacgcggcc aaggtgccgg ccgacaccgg taagccctcg ccgaggaggg gtctggccgg 

781 gccggggccg ggccggggca ggagtggcag cgcctctccc gaggcccgag gtccgggccg 

841 gtatccgcgc ggacctgatg cagggctgtg ggacgagggc cgctggggtc cgggcagggg 

901 cctcgcagcc gcagccccgt cggtgcgtcg agggggcagg gcggagcaca tgatgcccct 

40 961 tggactacgg ggcaggtaag gacgttttgg gtctcctgga ggaaggcggc cccggggcgc 

1021 gcactggctg tgcccgccag gcgacggggt taggagccga gcccgaggct ctgcgggaga 

1081 ccgggggagg ctgggccgcg tgggcttccg ctccctgccc tggcctccgc gtgcgcgccg 

1141 ccgcacgtag ccccagactc ctccccctcc tcgccggcgt cgtcccgcgc cgagctgctg 

12 01 ctgccctgag cccccagatc tgaacccctt cccttcggca acctgagcga ctcccgcctt 
45 1261 ccacggaagg gaccgagccc gtgccaaaca ggctgagcga tttgggagtg aggagccatc 

13 21 Gtaccgcttt ccccaacctg gaaacagcaa agcgcaaggc ctctgagtca gttaggtctc 
13 81 tgccacccac gggcaaagga tgctctcctc catcctcctt cctccctcca ccgaaatcgg 
1441 agagccgcgg gcctgatcca aagaggcatc cccttctcgt tcattcccca gaggcctcaa 
15 01 tacaaacccc aggagttggc ccctctcctt ttgctacaaa tccttgcctt gcaaagggga 

50 15 61 ggtgaggatg ggctatttta gaagggaagc agggttgctc cctggagaat gctgagtctg 
1621 tgaggtgcct atgccgagaa tagctcgagg aaattggagc cccagctgtt aaaagagcag 
1681 agggcagggt gagggccgtg gctctcaggg gtatctggaa ggctcttcga gttgagtgca 
1741 gacccagcct tgggctggaa aatggacaaa ggtcatcttg ctggggtgaa aagggggaga 
18 01 gcagaaccaa gaagaagagg gtgagggctg gggggctcca gggcactggt taggaattgt 

55 1861 ggggaatgaa ggctttcttt agtctcatcc ccctgtggta ccatcttgtc ctcagaggtg 
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1921 gtttgtgctc cccctactgc ctatatcgac ttcgcccggc agaagctaga tcccaagatt 
1981 gctgtggctg cgcagaactg ctacaaagtg actaatgggg cttttactgg ggagatcagg 
204i tgagatcgag gtggagaggg gtgtgtggga cccttccctc actttcctcg ttgaggggaa 
2101 agccacaggg tgggctccct gctgaacctt ggcttcatct cttcctttag ccctggcatg 
5 2161 atcaaagact gcggagccac gtgggtggtc ctggggcact cagagagaag gcatgtcttt 
2221 ggggagtcag atgaggttag tagccaagag agaagataag ggatgtcttt ttccaagaag 
2281 gatgtctcac caagtctgtt tctcaacagc tgattgggca gaaagtggcc catgctctgg 
2341 cagagggact cggagtaatc gcctgcattg gggagaagct agatgaaagg gaagctggca 
2401 tcactgagaa ggttgttttc gagcagacaa aggtcatcgc aggtatctct ggagaaaggg 

10 2461 acctttgagc ctatccaggg ccacagagac tcagagggta gggtcaggcc ctggagcctg 
2521 tcttggtccc catgctgatc cagaaaagga aaaaggggag ggggagtgac aatctttgct 
2581 tggggcctat gacttctcca gccccaaggt agatgccacc tggaaatccc ccaatgtcca 
2641 ctagggggca gtaggccacc gttcttcgta ctccggagaa cctggctgga gagctctttc 
2701 ttgttcaccc ttccctccat ctgtatctct gccctgcaga taacgtgaag gactggagca 

15 2761 aggtcgtcct ggcctatgag cctgtgtggg ccattggtac tggcaagact gcaacacccc 
2821 aacaggtaac cgggcccagg agccctgccc tcatcccagc ctgcctcaat aggtttggac 
2881 agacacagcc cacatggagc aaccccttat ttcaaagaca cagagacctt gaacccagag 
2941 acagtgactt gtccaagggc atccagtcca gggcctggct tggatcagag ccctggtact 
3 001 ctgactcagt cagaaaccac actaagtgtc cactggtgcc agtgattttt cctcttagag 

20 3 061 aggcagaaaa ggtcttactt aggccagctt cttgttctag gcccaggaag tacacgagaa 
3121 gctccgagga tggctgaagt ccaacgtctc tgatgcggtg gctcagagca cccgtatcat 
3181 ttatggaggt gagtggcttt ggttcccggc tgaggtggag tgggctgagg actagactga 
3241 gccctcggac atggaggtgg ggatggggca gactcatccc attcttgacc aagcccttgfc 
3 3 01 tctgctccct tcccaggctc tgtgactggg gcaacctgca aggagctggc cagccagcct 

25 3361 gatgtggatg gcttccttgt gggtggtgct tccctcaagc ccgaattcgt ggacatcatc 
3421 aatgccaaac aatgagcccc atccatcttc cctacccttc ctgccaagcc agggactaag 
3481 cagcccagaa gcccagtaac tgccctttcc ctgcatatgc ttctgatggt gtcatctgct 
3541 ccttcctgtg gcctcatcca aactgtatct tcctttactg tttatatctt caccctgtaa 
3601 tggttgggac caggccaatc ccttctccac ttactataat ggttggaact aaacgtcacc 

30 3 661 aaggtggctt ctccttggct gagagatgga aggcgtggtg ggatttgctc cfcgggttccc 

3 721 taggcccfcag tgagggcaga agagaaacca tcctctccct tcttacaccg tgaggccaag 
3781 atccGctcag aaggcaggag tgctgccctc tcccatggtg cccgtgcctc tgtgctgtgt 
3841 atgtgaacca cccatgtgag , ggaataaacc tggcactagg tcttgtggtt tgtctgcctt 
3901 cactggactt gcccagataa tctfccctttt tgaggcagct atataaatga tcatttgtgc 

35 3 961 aagaaaaaaa aaaaaacaag aacaggtttc tataacaaca tctcttacta tttttacttg 
4021 aaaaaatgtt ttgcgtagca gactgtcata gccttgaacg ccggctccct ttcttcctcc 

4 081 ctccaagtgg ctctggggct gttgatttcc gcagagcttg ggttggggta gggctcagcc 
4141 tcaccagctt tcagcagctg gtctaggcca gcagtgcctc cccacctccc caagggaggg 
4201 tggtggcaag acctcagcac agtctgtggt atcacaggct cactggtaga gcagtagcgc 

40 4261 ttcatgcagg gggcaagggc agggcagaca cctggccgag cggtatcccc aggttgtggc 
4321 gcacacacag gcggctcagg tgcagaaggg agtgtggctc cgctgggaga gagaaggagg 
4381 ggaatgtaag tatgggtgca gccaccagcc agatgtcctc aaactacggg gtcctcatca 
4441 gatgcctttc tgctttcctg cttcgagtgt gcccacctgg ctgaaagggg aatttgagat 
4501 acccggaagt tctgcctccc agataagatt tcacacatcc ctagtcagag ctgggggtga 

45 4561 agagctggct aaggccctct aaacaacagg ccaaggtggc tctgacagtg gtggagctgg 
4621 Gccaggcttt gactccagag gcttgggagc tggggctgag gtgaggaggg atggccctcc 
4681 actctacagc ccaacacaac tgcagagagc agctccaagc cctggaccca gtcagttcct 
4741 ggggaggctc ctcccctgct gccccaccct aaggcctgcc tcctccactg ctctcctcct 
4801 ccctggtgcc cagggcccca gtgtctccat cctgaggtgt ggctgaggaa ggaagtaggt 

50 4 861 atgtggcaca gagacaggtt agagcccagg gaatccggta tacagcctgg gtacctcgtc 
4921 tgcccatcct tcttttggac ctgtacatca aacccagtac ctaaccgttt gcacctcttg 
4981 cctaggggtg attactcctg aattc 

55 SEQ ID NO. 26 
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Q05586 - Homo sapiens 

Glutamate [NMDA] receptor subuixit zeta 1 precursor 

5 MSTMRLLTLA LLFSCSVARA ACDPKIVNIG AVLSTRKHEQ MFREAVNQAN KRHGSWKIQli 
NATSVTHKPN AIQMALSVCE DLISSQVYAI LVSHPPTPND HFTPTPVSYT AGFYRIPVLG 
LTTRMSIYSD KSIHLSFLRT VPPYSHQSSV WFEMMRVYSW NHIILLVSDD HEGRAAQKRL 
ETLLEBRESK AEKVIiQFDPG TKNVTALLME AKELEARVII LSASEDDAAT VYRAAAMLMM 
TGSGYVWLVG EREISGNADR YAPDGILGLQ LINGKNESAH ISDAVGWAQ AVHELliEKEN 

10 ITDPPRGCVG NTNIWKTGPL FKRVLMSSKY ADGVTGRVEF NEDGDRKFAN YSXMNLQNRK 
liVQVGXYNGT HVIPNDRKII WPGGETEKPR GYQMSTRIiKX VTIHQEPFVY VKPTLSDGTC 
KEEFTVNGDP VKKVICTGPN DTSPGSPRHT VPQCCYGFCI DLLIKLARTM NPTYEVHLVA 
DGKFGTQERV NNSNKKEWNG MMGELLSGQA DMIVAPLTIN NERAQYIEFS KPFKYQGLTI 
LVKKEIPRST LDSFMQPFQS TIiWLjLVGXjSV HWAVMLYLL DRFSPFGRFK VNSEEEEBDA 

15 LTLSSAMWPS WGVLLNSGIG EGAPRSFSAR ILGMVWAGFA MIIVASYTAN loAAFLVLDRP 
EERITGINDP RLRNPSDKFI YATVKQSSVD IYFRRQVELS TMYRHMEKHN YESAAEAIQA 
VRDNKBHAFI WDSAVLEFEA SQKCDLVTTG ELFFRSGFGI GMRKDSPWKQ NVSLSILKSH 
BNGFMBDIjDK TWVRYQECDS RSNAPATLTF ENMAGVFMIiV AGGIVAGIFL IFIEIAYKRH 
KDARRKQMQL AFAAWVWRK NLQDRKSGRA EPDPKKKATF RAITSTLASS FKRRRSSKDT 

20 STGGGRGAXiQ NQKDTVliPRR AIEREEGQLQ LCSRHRES 



SEQ ID NO- 27 

25 D13515 

Homo sapiens mRJSTA for key subunit of N-methyl-D -aspartate receptor, 
complete cds 

1 gcttcagcgc cccttccctc ggccgacgtc ccgggaccgc cgctccgggg gagacgtggc 
30 61 gtccgcagcc cgcggggccg ggcgagcgca ggacggcccg gaagccccgc gggggatgcg 

121 ccgagggccc cgcgttcgcg ccgcgcagag ccaggcccgc ggcccgagcc catgagcacc 
181 atgcgcctgc tgacgctcgc cctgctgttc tcctgctccg tcgcccgtgc cgcgtgcgac 
241 cccaagatcg tcaacattgg cgcggtgctg agcacgcgga agcacgagca gatgttccgc 
3 01 gaggccgtga accaggccaa caagcggcac ggctcctgga agattcagct caatgccacc 
35 361 tccgtcacgc acaagcccaa cgccatccag atggctctgt cggtgtgcga ggacctcatc 
421 tccagccagg tctacgccat cctagttagc catccaccta cccccaacga ccacttcact 
481 cccacccctg tctcctacac agccggcttc taccgcatac ccgtgctggg gctgaccacc 
541 cgcatgtcca tctactcgga caagagcatc cacctgagct tcctgcgcac cgtgccgccc 
6 01 tactcccacc agtccagcgt gtggtttgag atgatgcgtg tctacagctg gaaccacatc 
40 661 atcctgctgg tcagcgacga ccacgagggc cgggcggctc agaaacgcct ggagacgctg 
721 ctggaggagc gtgagtccaa ggcagagaag gtgctgcagt ttgacccagg gaccaagaac 
781 gtgacggccc tgctgatgga ggcgaaagag ctggaggccc gggtcatcat cctttctgcc 
841 agcgaggacg atgctgccac tgtataccgc gcagccgcga tgctgaacat gacgggctcc 
901 gggtacgtgt ggctggtcgg cgagcgcgag atctcgggga acgccctgcg ctacgcccca 
45 * 961 gacggcatcc tcgggctgca gctcatcaac ggcaagaacg agtcggccca catcagcgac 
1021 gccgtgggcg tggtggccca ggccgtgcac gagctcctcg agaaggagaa catcaccgac 
1081 ccgccgcggg gctgcgtggg caacaccaac atctggaaga ccgggccgct cttcaagaga 
1141 gtgctgatgt cttccaagta tgcggatggg gtgactggtc gcgtggagtt caatgaggat 
1201 ggggaccgga agttcgccaa ctacagcatc atgaacctgc agaaccgcaa gctggtgcaa 
50 1261 gtgggcatct acaatggcac ccacgtcatc cctaatgaca ggaagatcat ctggccaggc 
1321 ggagagacag agaagcctcg agggtaccag atgtccacca gactgaagat tgtgacgatc 
1381 caccaggagc ccttcgtgta cgtcaagccc acgctgagtg atgggacatg caaggaggag 
1441 ttcacagtca acggcgaccc agtcaagaag gtgatctgca ccgggcccaa cgacacgtcg 
1501 ccgggcagcc cccgccacac ggtgcctcag tgttgctacg gcttttgcat cgacctgctc 
55 1561 atcaagctgg cacggaccat gaacttcacc tacgaggtgc acctggtggc agatggcaag 
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1621 ttcggcacac aggagcgggt gaacaacagc aacaagaagg agtggaatgg gatgatgggc 
1681 gagctgctca gcgggcaggc agacatgatc gtggcgccgc taaccataaa caacgagcgc 
1741 gcgcagfcaca tcgagttttc caagcGcttc aagtaccagg gcctgactat tctggtcaag 
1801 aaggagattc cccggagcac gctggactcg ttcatgcagc cgttccagag cacactgtgg 
5 1861 ctgctggtgg ggctgtcggt gcacgtggtg gccgtgatgc tgtacctgct ggaccgcttc 
1921 agccccttcg gccggttcaa ggtgaacagc gaggaggagg aggaggacgc actgaccctg 
1981 tcctcggcca tgtggttctc ctggggcgtc ctgctcaact ccggcatcgg ggaaggcgcc 

2 041 cccagaagct tctcagcgcg catcctgggc atggtgtggg ccggctttgc catgatcatc 
2101 gtggcctcct acaccgccaa cctggcggcc ttcctggtgc tggaccggcc ggaggagcgc 

10 2161 atcacgggca tcaacgaccc tcggctgagg aacccctcgg acaagtttat ctacgccacg 
2221 gtgaagcaga gctccgtgga tatctacttc cggcgccagg tggagctgag caccatgtac 
2281 cggcatatgg agaagcacaa ctacgagagt gcggcggagg ccatccaggc cgtgagagac 
2341 aacaagctgc atgccttcat ctgggactcg gcggtgctgg agttcgaggc ctcgcagaag 
24 01 tgcgacctgg tgacgactgg agagctgttt ttccgctcgg gcttcggcat aggcatgcgc 

15 24 61 aaagacagcc cctggaagca gaacgtctcc ctgtccatcc tcaagtccca cgagaatggc 
2521 ttcafcggaag acctggacaa gacgtgggtt cggtatcagg aatgtgactc gcgcagcaac 
2581 gcGCctgcga cccttacttt tgagaacatg gccggggtct tcatgctggt agctgggggc 
2641 atcgtggccg ggatcttcct gattttcatc gagattgcct acaagcggca caaggatgct 
2701 cgccggaagc agatgcagcfc ggcctttgcc gccgttaacg tgtggcggaa gaacctgcag 

20 27 61 gatagaaaga gtggtagagc agagcctgac cctaaaaaga aagccacatt tagggctatc 
2821 acctccaccc tggcttccag cttcaagagg cgtaggtcct ccaaagacac gagcaccggg 
2881 ggtggacgcg gcgctttgca aaaccaaaaa gacacagtgc tgccgcgacg cgctattgag 
2941 agggaggagg gccagctgca gctgtgttcc cgtcataggg agagctgaga ctccccgccc 

3 001 gccctcctct gccccctccc ccgcagacag acagacagac ggacgggaca gcggcccggc 
25 3 061 ccacgcagag ccccggagca ccacggggtc gggggaggag cacccccag 



SEQ ID NO. 28 

30 LLTLIiALIiFSCSVAR 



SKQ XD NO. 29 

35 ITMLCTGSRTLK 



SBQ XD NO. 3 0 

40 ITHU and P01009 - Homo sapiens 

a- 1 -antitrypsin precursor 

Ttipssvswgil llaglcclvp vslaedpqgd aaqktdtshh dqdhptfnki tpnlaefafs 
lyrqlahqBTi stniffspvs iatafamlsl gtkadtlxdei leglnfnlte ipeaqihegf 
45 qellrtlnqp dsqlqlttgn glflseglkl vdkfledvkk lyhseaftvn fgdteeakkq 
indyvekgtq gkivdlvkel drdtvfalvn yiffkgkwer pfevkdteee dfhvdqvttv 

kvpmmkrlgm fniqhckkls swvllmkylg nataifflpd egklqhlene Ithdiitkfl 
enedrrsasl hlpklsitgt ydlksvlgql gitkvfsnga dlsgvteeap Iklskavhka 
vltidekgte aagamfleai pnisippevkf nkpfvflmie qntksplfmg kwnptqk 
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SBQ XD NO. 31 

NM 000295 
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Homo sapiens serine (or cysteine) proteinase inhibitor, clade A (alpha- 1 
antiproteinase, antitrypsin), member 1 (SSRPINAl) , transcript variant 1, 
mRWA 

5 1 aatgactcct ttcggtaagt gcagtggaag ctgtacactg cccaggcaaa gcgtccgggc 

61 agcgtaggcg ggcgactcag atcccagcca gtggacttag cccctgtttg ctcctccgat 
121 aactggggtg accttggtta atattcacca gcagcctccc ccgttgcccc tctggatcca 
181 ctgcttaaat acggacgagg acagggccct gtctcctcag cttcaggcac caccactgac 
241 ctgggacagt gaatcgacaa tgccgtcttc tgtctcgtgg ggcatcctcc tgctggcagg 
10 3 01 cctgtgctgc ctggtccctg tctccctggc tgaggatccc cagggagatg ctgcccagaa 

3 61 gacagataca tcccaccatg atcaggatca cccaaccttc aacaagatca cccccaacct 
421 ggctgagttc gccttcagcc tataccgcca gctggcacac cagtccaaca gcaccaatat 
481 Gttcttctcc ccagtgagca tcgctacagc ctttgcaatg ctctccctgg ggaccaaggc 
541 tgacactcac gatgaaatcc tggagggcct gaatttcaac ctcacggaga ttccggaggc 
15 601 tcagatccat gaaggcttcc aggaactcct ccgtaccctc aaccagccag acagccagct 

661 ccagctgacc accggcaatg gcctgttcct cagcgagggc ctgaagctag tggataagtt 
721 tttggaggat gttaaaaagt tgtaccactc agaagccttc actgtcaact tcggggacac 
781 cgaagaggcc aagaaacaga tcaacgatta cgtggagaag ggtactcaag ggaaaattgt 
841 ggatttggtc aaggagcttg acagagacac agtttttgct ctggtgaatt acatcttctt 
20 901 taaaggcaaa tgggagagac cctttgaagt caaggacacc gaggaagagg acttccacgt 

961 ggaccaggtg accaccgtga aggtgcctat gatgaagcgt ttaggcatgt ttaacatcca 
1021 gcactgtaag aagctgtcca gctgggtgct gctgatgaaa tacctgggca atgccaccgc 
1081 catcttcttc ctgcctgatg aggggaaact acagcacctg gaaaatgaac tcacccacga 
1141 tatcatcacc aagttcctgg aaaatgaaga cagaaggtct gccagcttac atttacccaa 
25 12 01 actgtccatt actggaacct atgatctgaa gagcgtcctg ggtcaactgg gcatcactaa 

12 61 ggtcttcagc aatggggctg acctctccgg ggtcacagag gaggcacccc tgaagctctc 
1321 caaggccgtg cataaggctg tgctgaccat cgacgagaaa gggactg-aag ctgctggggc 

13 81 catgttttta gaggccatac ccatgtctat cccccccgag gtcaagttca acaaaccctt 
1441 tgtcttctta atgattgaac aaaataccaa gtctcccctc ttcatgggaa aagtggtgaa 

30 1501 tcccacccaa aaataactgc ctctcgctcc tcaacccctc ccctccatcc ctggccccct 
15 61 ccctggatga cattaaagaa gggttgagct ggtccctgcc tgcaaaa 



35 



SEQ XT) NO. 32 

K02212 

Human alpha- 1-antitrypsin gene {S variant) , complete cds 



1 gaattccagg ttggaggggc ggcaacctcc tgccagcctt caggccactc tcctgtgcct 

40 61 gccagaagag acagagcttg aggagagctt gaggagagca ggaaaggtgg aacattgctg 

121 ctgctgctca ctcagttcca caggtgggag gaacagcagg gcttagagtg ggggtcattg 

181 tgcagatggg aaaacaaagg cccagagagg ggaagaaatg cctaggagct accgagggca 

241 ggcgacctca accacagccc agtgctggag ctgtgagtgg atgtagagca gcggaatatc 

3 01 cattcagcca gctcagggga aggacagggg ccctgaagcc aggggatgga gctgcaggga 
45 3 61 agggagctca gagagaaggg gaggggagtc tgagctcagt ttcccgctgc ctgaaaggag 

421 ggtggtacct actcccttca aagggtaact gaatgagaga ctgcctggag gaaagctctt 

4 81 caagtgtggc ccaccccacc ccagtgacac cagcccctga cacgggggag ggagggcagc 
541 atcaggaggg gctttctggg cacacccagt acccgtctct gagctttcct tgaactgttg 
601 cattttaatc ctcacagcag ctcaacaagg tacataccgt caccatcccc attttacaga 

50 661 tagggaaatt gaggctcgga gcggt taaac aactcacctg aggcctcaca gccagtaagt 

721 gggttccGtg gtctgaatgt gtgtgctgga ggatcctgtg ggtcactcgc ctggtagagc 

781 cccaaggtgg aggcataaat gggactggtg aatgacagaa ggggcaaaaa tgcactcatc 

841 cattcactct gcaagtatct acggcacgta cgccagctcc caagcaggtt tgcgggttgc 

901 acagcggagc gatgcaatct gatttaggct tttaaaggat tgcaatcaag tgggacccac 

55 961 tagcctcaac cctgtacctc ccctcccctc cacccccagc agtctccaaa ggcctccaac 
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1021 aaccccagag tgggggccat gtatccaaag aaactccaag ctgtatacgg atcacactgg 
1081 ttttccagga gcaaaaacag aaacagcctg aggctggtca aaattgaacc tcctcctgct 
1141 ctgagcagcc tagggggcag actaagcaga gggctgtgca gacccacata aagagcctac 

12 01 tgtgtgccag gcacttcacc cgaggcactt cacaagcafcg cttgggaatg aaacttccaa 
5 12 SI ctctttggga tgcaggtgaa acagttcctg gttcagagag gtgaagcggc ctgcctgagg 

1321 cagcacagct cttctttaca gatgtgcttc cccacctcta ccctgtctca cggcccccca 

13 81 tgccagcctg acggttgtgt ctgcctcagt catgctccat ttttccatcg ggaccatcaa 
1441 gagggtgttt gtgtctaagg ctgactgggt aactttggat gagcggtctc tccgctccga 
1501 gcctgtttcc tcatctgtca aacgggctct aacccactct gatctcccag ggcggcagta 

10 1561 agtcttcagc atcaggcatt ttggggtgac tcagtaaatg gtagatcttg ctaccagtgg 
1621 aacagccact aaggattctg cagfcgagagc agagggccag ctaagtggta ctctcccaga 
1681 gactgtctga ctcacgccac cccctccacc ttggacacag gacgctgtgg tttctgagcc 
1741 aggtacaatg actcctttcg gtaagtgcag tggaagctgt acactgccca ggcaaagcgt 
1801 ccgggcagcg taggcgggcg actcagatcc cagccagtgg acttagcccc tgtttgctcc 

15 1861 tccgataact ggggtgacct tggttaatat tcaccagcag cctcccccgt tgcccctctg 
1921 gatccactgc ttaaatacgg acgaggacag ggccctgtct cctcagcttc aggcaccacc 
1981 actgacctgg gacagtgaat cgtaagtatg cctttcactg cgaggggttc tggagaggct 
2 041 tccgagctcc ccatggccca ggcaggcagc aggtctgggg caggaggggg gttgtggagt 
2101 gggtatccgc ctgctgaggt gcagggcaga tggagaggct gcagctgagc tcctattttc 

20 2161 ataataacag cagccatgag ggttgtgtcc tgtttcccag tcctgcccgg tcccccctcg 
2221 gfcacctcctg gtggatacac tggttcctgt aagcagaagt ggatgagggt gtctaggtct 
2281 gcagtcctgg caccccagga tgggggacac cagccaagat acagcaacag caacaaagcg 
2341 cagccatttc tttctgtttg cacagctcct ctgtctgtcg ggggctcctg tctgttgtct 
2401 cctataagcc tcaccacctc tcctactgct tgggcatgca tctttctccc cttctataga 

25 2461 tgaggaggtt aaggttcaga gaggggtggg gaggaacgcc ggctcacatt ctccatcccc 
2521 tccagatatg accaggaaca gacctgtgcc agcctcagcc ttacatcaaa atgggcctcc 
2581 ccatgcaccg tggacctctg ggccctcctg tcccagtgga ggacaggaag ctgtgagggg 
2641 cactgtcacc cagggctcaa gctggcattc ctgaataatc gctctgcacc aggccacggc 
2701 taagctcagt gcgtgattaa gcctcataac cctccaaggc agttactagt gtgattccca 

30 2761 ttttacagat gaggaagatg gggacagaga ggtgaataac tggccccaaa tcacacacca 
2821 tccataattc gggctcaggc acctggctcc agtccccaaa ctcttgaacc tggccctagt 

2 881 gtcactgttt ctcttgggtc tcaggcgctg gatggggaac aggaaacctg ggctgaactt 
2941 gaggcctctc tgatgctcgg tgacttcaga cagttgctca acctctctgt tctcttgggc 

3 001 aaaacatgat aacctttgac ttctgtcccc tcccctcacc ccacccgacc ttgatctctg 
35 3 061 aagtgttgga aggatttaat ttttcctgca ctgagttttg gagacaggtc aaaaagatga 

3121 ccaaggccaa ggtggccagt ttcctataga acgcctctaa aagacctgca gcaatagcag 
3181 caagaactgg tattctcgag aacttgctgc gcagcaggca cttcttggca ttttatgtgt 
3241 atttaatttc acaatagctc tatgacaaag tccacctttc tcatctccag gaaactgagg 
33 01 ttcagagagg ttaagtaact tgtccaaggt cacacagcta atagcaagtt gacgtggagc 

40 3361 aatctggcct cagagccttt aattttagcc acagactgat gctcccctct tcatttagcc 
3421 aggctgcctc tgaagttttc tgattcaaga cttctggctt cagctttgta cacagagatg 
3481 attcaatgtc aggttttgga gcgaaatctg tttaatccca gacaaaacat ttaggattac 
3541 atctcagttt tgtaagcaag tagctctgtg atttttagtg agttatttaa tgctctttgg 
3 601 ggctcaattt fctctatctat aaaatagggc taataatttg caccttatag ggtaagcttt 

45 3 661 gaggacagat tagatgatac ggtgcctgta aaacaccagg tgttagtaag tgtggcaatg 
3 721 atggtgacgc tgaggctgtg tttgcttagc atagggttag gcagctggca ggcagtaaac 

3 781 agttggataa tttaatggaa aatttgccaa actcagatgc tgttcactgc tgagcaggag 
3841 ccccttcctg ctgaaatggt cctggggagt gcagcaggct ctccgggaag aaatctacca 
3901 tctctcgggc aggagctcaa cctgtgtgca ggtacaggga gggcttcctc acctggtgcc 

50 3961 cactcatgca ttacgtcagt tattcctcat ccctgtccaa aggattcttt tctccattgt 

4 021 acagctatga agctagtgct caaagaagtg aagtcattta ccccaggccc cctgccagta 
4 081 agtgacaggg cctggtcaca cttgggttta tttattgccc agttcaacag gttgtttgac 
4141 cataggcgag attctcttcc ctgcaccctg ccgggttgct cttggtccct tattttatgc 
42 01 fccctgggtag aaatggtgcg agafctaggca gggagtggac gcttccctgt ccctggcccc 

55 4261 gcaaagagtg ctcccacctg ccccgatccc agaaatgtca ccatgaagcc ttcattcttt 
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4321 tggtttaaag cttggcctca gtgtccgtac accatggggt ccttggccag atggcgactt 
43 81 tctcctctcc agtcgccctc ccaggcacta gcttttagga gtgcagggtg ctgcctctga 
4441 tagaagggcc aggagagagc aggttttgga gacctgatgt tataaggaac agcttgggag 
45 01 gcataatgaa cccaacatga tgcttgagac caatgtcaca gcccaattct gacattcatc 
5 45 61 atctgagatc tgaggacaca gctgtctcag ttcatgatct gagtgctggg aaagccaaga 
4621 cttgttccag ctttgtcact gacttgctgt atagcctcaa caaggccctg accctctctg 
4681 ggcttcaaac tcttcactgt gaaaggagga aaccagagta ggtgatgtga caccaggaaa 
4741 gatggatggg tgtgggggaa tgtgctcctc ccagctgtca ccccctcgcc accctccctg 
48 01 caccagcctc tccacctcct ttgagcccag aattcccctg tctaggaggg cacctgtctc 

10 4 861 gtgcctagcc atgggaattc tccatctgtt ttgctacatt gaacccagat gccattctaa 
4921 ccaagaatcc tggctgggtg caggggctct cgcctgtaac cccagcactt tgggaggcca 
4981 aggcaggcgg atcaagaggt caggagttca agacctgcct ggccaacacg gtgaaacctc 
5041 agctctacta aaaatacaaa aattagccag gcgtggtggc acacgcctgt aatcccagct 
5101 atttgggaag ctgagacaga agaatttctt gaacccggga ggtggaggtt tcagtgagcc 

15 5161 gagatcacgc cactgcactc caccctggcg gataaagcga gactctgtct caaaaaaaac 
5221 ccaaaaacct atgttagtgt acagagggcc ccagtgaagt cttctcccag ccccactttg 
52 81 cacaactggg gagagtgagg ccccaggacc agaggattct tgctaaaggc caagfcggata 
5341 gtgatggccc tgccaggcta gaagccacaa cctctggccc tgaggccact cagcatattt 
5401 agtgtcccca ccctgcagag gcccaactcc ctcctgacca ctgagccctg taatgatggg 

20 5461 ggaatttcca taagccatga aggactgcac aaagttcagt tgggagtgaa agagaaatta 
5521 aagggagatg gaaatataca gcactaattt tagcaccgtc ttcagttcta acaacactag 
5581 ctagctgaag aaaatacaaa catgtattat gtaatgtgtg gtctgttcca tttggattac 
5 641 ttagaggcac gagggccaag gagaaaggtg gtggagagaa accagctttg cacttcattt 
5701 gttgctttat tggaaggaaa cttttaaaag bccaaggggg ttgaagaatc tcaatatttg 

25 5 761 ttatttccag ctttttttct ccagtttttc afcttcccaaa ttcaaggaca cctttttctt 
5 821 tgtattttgt taagatgatg gttttggttt tgtgactagt agttaacaat gtggctgccg 
5881 ggcatattct cctcagctag gacctcagtt ttcccatctg tgaagacggc aggttctacc 
5 941 tagggggctg caggcaggtg gtccgaagcc tgggcatatc tggagtagaa ggatcactgt 
6001 ggggcagggc aggttctgtg ttgctgtgga tgacgttgac tfctgaccatt gctcggcaga 

30 6061 gcctgctctc gctggttcag ccacaggccc caccactccc tattgtctca gccccgggta 
6121 tgaaacatgt atfccctcact ggcctatcac ctgaagcctt tgaatttgca acacctgcca 
6181 acccctccct caaaagagtt gccctctcta gatccttttg atgtaaggtt tggtgttgag 
6241 acttatttca ctaaattctc atacataaac atcactttat gtatgaggca aaatgaggac 
63 01 cagggagatg aatgacttgt cctggctcat acacctggaa agtgacagag tcagattaga 

35 6361 tcctaggtct atctgaagtt aaaagaggtg tcttttcact tcccacctcc tccatctact 
6421 ttaaagcagc acaaacccct gctttcaagg agagatgagc gtctctaaag cccctgacag 
6481 caagagccca gaactgggac accattagtg acccagacgg caggtaagct gactgcagga 
6541 gcatcagcct attcttgtgt ctgggaccac agagcattgt ggggacagcc ccgtctcttg 
6601 ggaaaaaaac cctaagggct gaggatcctt gtgagtgfctg ggtgggaaca gctcccagga 

40 6661 ggtttaatca cagcccctcc atgctctcta gctgttgcca ttgtgcaaga tgcatttccc 
6721 ttctgtgcag cagtttccct ggccactaaa tagtgggatt agatagaagc cctccaaggg 
6781 Gtccagcttg acatgattct tgattctgat ctgacccgat tctgataatc gtgggcaggc 
6841 ccattcctct tcttgtgcct cattttcttc ttttgtaaaa caatggctgt accatttgca 
6901 tcttagggtc attgcagatg aaagtgttgc tgtccagagc ctgggtgcag gacctagatg 

45 6961 taggattctg gttctgctac ttcctcagtg acattgaata gctgacctaa tctctctggc 
7021 tttggtttct tcatctgtaa aagaaggata ttagcattag cacctcacgg gattgttaca 
7081 agaaagcaat gaattaacac atgtgagcac ggagaacagt gcttggcata tggfcaagcac 
7141 tacgtacatt ttgctattct tctgattctt tcagtgttac tgatgtcggc aagtacttgg 

72 01 cacaggctgg tttaataatc cctaggcact ttcacgtggt gbcaafcccct gatcactggg 
50 7261 agtcatcatg tgccttgact cgggcctggc ccccccatct ctgtcttgca ggacaatgcc 

7321 gtcttctgtc tcgtggggca tcctcctgct ggcaggcctg tgctgcctgg tccctgtctc 

73 81 cctggctgag gatccccagg gagatgctgc ccagaagaca gatacatccc accatgatca 
7441 ggatcaccca accttcaaca agatcacccc caacctggct gagttcgcct tcagcctata 
75 01 ccgccagctg gcacaccagt ccaacagcac caatatcttc ttctccccag tgagcatcgc 

55 7561 tacagccttt gcaatgctct ccctggggac caaggctgac actcacgatg aaatcctgga 
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7621 gggcctgaat ttcaacctca cggagattcc ggaggctcag atccatgaag gcttccagga 
7681 actcctccgt acccfccaacc agccagacag ccagctccag ctgaccaccg gcaatggcct 
7741 gttcctcagc gagggcctga agctagtgga taagfcttttg gaggatgtta aaaagttgta 
78 01 ccactcagaa gccttcactg tcaacttcgg ggacaccgaa gaggccaaga aacagatcaa 
5 7861 cgattacgtg gagaagggta ctcaagggaa aattgtggat ttggtcaagg agcttgacag 
7921 agacacagtt tttgctctgg tgaattacat cttctttaaa ggtaaggttg ctcaaccagc 
7981 ctgagctgtt tcccatagaa acaagcaaaa atatttctca aaccatcagt tcttgaactc 
8041 tccttggcaa tgcattatgg gccatagcaa tgcttttcag cgtggattct tcagttttct 
8101 acacacaaac actaaaatgt tttccatcat tgagtaattt gaggaaataa tagattaaac 

10 8161 tgtcaaaact actgacgctc tgcagaactt ttcagagcct ttaatgtcct tgtgtatact 
8221 gtatatgtag aatatataat gcttagaact atagaacaaa ttgtaataca ctgcataaag 
8281 ggatagtttc atggaacata ctttacacga ctctagtgtc ccagaatcag tatcagtttt 
8341 gcaatctgaa agacctgggt tcaaatcctg cctctaacac aattagcttt tgacaaaaac 
8401 aatgcattct acctctttga ggtgctaatt tctcatctta gcatggacaa aataccattc 

15 84 61 ttgctgtcag gtttttttag gattaaacaa atgacaaaga ctgtggggat ggtgtgtggc 
8521 atacagcagg tgatggactc ttctgtatct caggctgcct tcctgcccct gaggggttaa 
85 81 aatgccaggg tcctgggggc cccagggcat tctaagccag ctcccactgt cccaggaaaa 
8641 cagcataggg gaggggaggt gggaggcaag gccaggggct gcttcctcca ctctgaggct 
8701 cccttgctct tgaggcaaag gagggcagtg gaggcaagcc aggctgcagt cagcacagct 

20 8761 aaagtcctgg ctctgctgtg gccttagtgg gggcccaggt ccctctccag ccccagtctc 
8821 ctccttctgt ccaatgagaa agctgggatc aggggtccct gaggcccctg tccactctgc 
88 81 atgcctcgat ggtgaagctc tgttggtatg gcagagggga ggctgctcag gcatctgcat 
8941 ttcccctgcc aatctagagg atgaggaaag ctctcaggaa tagtaagcag aatgtttgcc 
9001 ctggatgaat aactgagctg ccaattaaca aggggcaggg agccttagac agaaggtacc 

25 9061 aaatatgcct gatgctccaa cattttattt gtaatatcca agacaccctc aaataaacat 
9121 atgattccaa taaaaatgca cagccacgat ggcatcfcctt agcctgacat cgccacgatg 
9181 tagaaattct gcatcttcct ctagttttga attatcccca cacaatcttt ttcggcagct 
9241 tggatggtca gtttcagcac cttttacaga tgatgaagct gagcctcgag ggatgtgtgt 

93 01 cgtcaagggg gctcagggct tctcagggag gggactcatg gtttcttatt ctgctacact 
30 93 61 cttccaaacc ttcactcacc cctggtgatg cccaccttcc cctctctcca ggcaaatggg 

9421 agagaccctt tgaagtcaag gacaccgagg aagaggactt ccacgtggac caggtgacca 

94 81 ccgtgaaggt gcctatgatg aagcgtttag gcatgtttaa catccagcac tgtaagaagc 
9541 tgtccagctg ggtgctgctg atgaaatacc tgggcaatgc caccgccatc ttcttcctgc 
96 01 ctgatgaggg gaaactacag cacctggtaa atgaactcac ccacgatatc atcaccaagt 

35 9661 tcctggaaaa tgaagacaga aggtgattcc ccaacctgag ggtgaccaag aagctgccca 
9721 cacctcttag ccatgttggg actgaggccc atcaggactg gccagagggc tgaggagggt 
9781 gaacGccaca tccctgggtc actgctactc tgtataaact tggcttccag aatgaggcca 
9841 ccactgagtt caggcagcgc cgtccatgct ccatgaggag aacagtaccc agggtgagga 
9901 ggtaaaggtc tcgtccctgg gaactfcccca ctccagtgtg gacactgtcc cttcccaata 

40 9961 tccagtgccc aaggcaggga cagcagcacc accacacgtt ctggcagaac caaaaaggaa 
10021 cagatgggct tcctggcaaa ggcagcagtg gagtgtggag ttcaagggta gaatgtccct 
10081 ggggggacgg gggaagagcc tgtgtggcaa ggcccagaaa agcaaggttc ggaattggaa 
10141 cagccaggcc atgttcgcag aaggcttgcg tttctctgtc actttatcgg tgctgttaga 
102 01 ttgggtgtcc tgtagtaagt gatacttaaa catgagccac acattagtgt atgtgtgtgc 

45 10261 attcgtgatt atgcccatgc cctgctgatc tagttcgttt tgtacactgt aaaaccaaga 
10321 tgaaaataca aaaggtgtcg ggttcataat aggaatcgag gctggaattt ctctgttcca 
10381 tgccagcacc tcctgaggtc tctgctccag gggttgagaa agaacaaaga ggctgagagg 
10441 gtaacggatc agagagccca gagccagctg ccgctcacac cagaccctgc tcagggtggc 
105 01 attgtctccc catggaaaac cagagaggag cactcagcct ggtgtggtca ctcttctctfc 

50 10561 atccactaaa cggttgtcac tgggcactgc caccagcccc gtgtttctct gggtgtaggg 
10621 ccctggggat gttacaggct gggggccagg tgacccaaca ctacagggca agatgagaca 
10681 ggcttccagg acacctagaa tatcagagga ggtggcattt caagcttttg tgattcattc 
10741 gatgttaaca ttcfcttgact caatgtagaa gagcfcaaaag tagaacaaac caaagccgag 
10801 ttcccatctt agtgtgggtg gaggacacag gagtaagfcgg cagaaafcaat cagaaaagaa 

55 108 61 aacacttgca ctgtggtggg tcccagaaga acaagaggaa tgctgtgcca tgccttgaat 
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10921 
10981 
11041 
11101 
5 11161 
11221 
11281 
11341 
11401 

10 11461 
11521 
11581 
11641 
11701 

15 11761 
11821 
11881 
11941 
12001 

20 12061 
12121 
12X81 



ttcttttctg 
ctatgatctg 
tgacctctcc 
acgaccttgt 
tgaggctgag 
cccgtgattc 
cagctggcac 
cccaagacct 
taagttgttt 
tgaatgattt 
ctgcagttcc 
gggtccgtgt 
gacgtggagt 
aggcgcttgt 
ggagcaaggc 
agtgagcgct 
caacgtgtct 
agaaagggac 
ccgaggtcaa 
ccctcttcat 
ccctcccctc 
ctgcctgcat 



cacgacaggt 
aagagcgtcc 

srgsrgfccacag 

tgcaccatgg 
gaaggggccg 
actgacacgg 
tttcggggcc 
tctagacatt 
caaaggaatt 
cagctaaagt 
atgaatggct 
cctcaaggag 
gacgatgctc 
caggaagatg 
ctatgtgaca 
tcctgggagg 
ctgcttctct 
tgaagctgct 
gtt caacaaa 
gggaaaagtg 
catccctggc 
gtgatctgta 



ctgccagctt 
tgggtcaact 
aggaggcacc 
tatctgtagg 
agggaaacaa 
gacggtgggc 
tcccttgagg 
gggccttgat 
tctgactcct 
gacacttatt 
gcattatctt 
cttcaagccg 
ttccctgttc 
gacagagggg 
gggagggaga 
tgtccacgtg 
cccctccagg 
ggggccatgt 
ccctttgtct 
gtgaatccca 
cccctcccfcg 
aatccctggg 



acatttaccc 
gggcatcact 
cctgaagcfcc 
gaagaatgtg 
atgaagaccc 
aaacagcaaa 
ttgtgtcact 
ttatccatac 
tcgatctgtg 
ttggaaaact 
ggggtctggg 
tgtactagaa 
tgagttgtgg 
agccagcccc 
ggatgtgcag 
agccttgctc 
ccgtgcataa 
ttttagaggc 
tcttaatgat 
cccaaaaata 
gatgacatta 
atgttttctc 



aaactgtcca 
aaggtcttca 
tccaaggtga 

tgggggctgc 

aggctgagct 
gccaggcagg 
gaccctgaat 
tgacacagaa 
agatttggtg 
aaaggcgacc 
cactgtgaag 
aggagagagc 
gtgcacctga 
atcagccaaa 
ggccagggcc 
gaggcctggg 
ggctgtgctg 
catacccatg 
tgaacaaaat 
actgcctctc 
aagaagggtt 

tg 



ttactggaac 
gcaatggggc 
gatcaccctg 
agcactgtcc 
cctgaagatg 
ggctgctgtg 
ttcaactttg 
aggtttgggc 
tctgaattaa 
aatgaacaac 
gtcactgcca 
cctggaggca 
gcagggggag 
gccttgagga 
gtccaggggg 
atcagcctta 
accatcgacg 
tctatccccc 
accaagtctc 
gctcctcaac 
gage tgg tec 



25 SEQ ID NO. 33 



gi/125294, P12277 - Homo sapiens 
Creatine kinase, B chain (B-CK) 



30 mpfsnshnal 
vdnpghpyim 
Idpnyvlssr 
eaeqqqlidd 
qkggnmkevf 

35 nlgkhekfse 
ieraeqrlegg 



klrfpaedef 
tvgcvagdee 
vrtgrsirgf 
hf Ifdkpvsp 
trf ctgltqi 
vlkrlrlqkr 
qaiddlmpag 



pdlsahnnhm 
syevf kdlf d 
clpphcsrge 
lllasgmard 
etlf kskdye 
gtggvdtaav 
k 



akvltpelya 
piiedrhggy 
rraieklave 
wpdargiwhn 
fmwnphlgyi 
ggvf dvsnad 



elrakstpsg 
kpsdehktdl 
alssldgdla 
dnktf Ivwvn 
Itcpsnlgtg 
rlgf sevelv 



f tlddviqtg 
npdnlqggdd 
gryyalksmt 
eedtilrvism 
Iragvhiklp 
qmwdgvkll 



SEQ XD NO. 34 

40 

NM_001823 Hoino sapiens creatine kinase, brain (CKB) , mRNA 
Creatine kinase, B chain (B-GK) 

1 gctgttcgcc tgcgtcgctc cgggagctgc cgacggacgg agcgcccccg cccccgcccg 
45 61 gccgcccgcc cgccgccgcc atgcccttct ccaacagcca caacgcactg aagctgcgct 

121 tcccggccga ggacgagttc cccgacctga gcgcccacaa caaccacatg gccaaggtgc 
181 tgacccccga gctgtacgcg gagctgcgcg ccaagagcac gccgagcggc ttcacgctgg 
241 acgacgtcat ccagacaggc gtggacaacc cgggccaccc gtacatcatg accgtgggct 
301 gcgtggcggg cgacgaggag tcctacgaag tgttcaagga tctcttcgac cccatcatcg 
50 3 61 aggaccggca cggcggctac aagcccagcg atgagcacaa gaccgacctc aaccccgaca 
421 acctgcaggg cggcgacgac ctggacccca actacgtgct gagctcgcgg gtgcgcacgg 
481 gccgcagcat ccgtggcttc tgcctccccc cgcactgcag ccgcggggag cgccgcgcca 
541 tcgagaagct cgcggtggaa gccctgtcca gcctggacgg cgacctggcg ggccgatact 
6 01 acgcgctcaa gagcatgacg gaggcggagc agcagcagct catcgacgac cacttcctct 
55 661 tcgacaagcc cgtgtcgccc ctgctgctgg cctcgggcat ggcccgcgac tggcccgacg 
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721 cccgcggtat ctggcacaat gacaataaga ccttcctggt gtgggtcaac gaggaggacc 

781 acctgcgggt catctccatg cagaaggggg gcaacatgaa ggaggtgttc acccgcttct 

841 gcaccggcct cacccagatt gaaactctct tcaagtctaa ggactatgag ttcatgtgga 

901 accctcacct gggctacatc ctcacctgcc catccaacct gggcaccggg ctgcgggcag 

5 961 gtgtgcatat caagctgccc aacctgggca agcatgagaa gttctcggag gtgcttaagc 

1021 ggctgcgact tcagaagcga ggcacaggcg gtgtggacac ggctgcggtg ggcggggtct 

1081 tcgacgtctc caacgctgao cgcctgggct tctcagaggt ggagctggtg cagatggtgg 

1141 tggacggagt gaagctgctc abcgagatgg agcagcggct ggagcagggc caggccatcg 

12 01 acgacctcat gcctgcccag aaatgaagcc cggcccacac ccgacaccag ccctgctgct 
10 1261 tcctaactta ttgcctgggc agtgcccacc atgcacccct gatgttcgcc gtctggcgag 

1321 cccttagcct tgctgtagag acttccgtca cccttggtag agtttatttt tttgatggct 

13 81 aagatactgc tgatgctgaa ataaactagg gttttggcct gcctgcgtct g 



15 SEQ ID NO. 35 

X15334 

Human gene for creatine kinase B (EC 2.7.3.2) . 

20 1 gatcagtttt tttttttaat cgcacttatg cttattgttt attagcgttt cctcccatct 

61 ttgcctgaag tctccgggga ctgcctttgg gggtcgggta aacttgtccc ctgcgaagag 
121 ggcccagggt tggggtctgg aaactccgag gctgcacttg ccagcggcct cttaaggcca 
181 cagcgtGGcc gtggtfctctg gctcgcagcc ccccgagacc caggacttgt ccaaggtcag 
241 ggcaccgcgg gtgcccccgg gctgggccgc agcagactgc gcttcccgcg cgccttcgct 
25 301 ttgcaccagg atcgcccagg aaatgcctgc gggcaccttg aggaaggtcg gcggctccgg 
3 61 gccagctcgc actggccggg gtggggcggg ggccgtacct gcfcgcggaag ccccgaaagc 
421 tttcgcccgg cccctcgccg ccgccgcggg ggctggctgg actaggcggg caggctcgag 
481 gatgcggatg aacccaagcg tcctcgagtg cccggaggct ctccgcctca gtttcccgcc 
541 cagaggcaag ggcgtgcgag gggatccaga tatccaagga cctgaggttt cggcctcgag 
30 601 gtcttgggcg ggggactggg caggctgcgc ggggtcccag cgaggggaca gctcgggtgg 
661 gcggccaggg tgttgggggc tgcgggcggc ggacaaagcg gcggcaccac cccgcggcgc 
721 gggccaatgg aatgaatggg ctataaatag ccgccaatgg gcggcccgcg ttgtgcccct 
781 taagagccgc gggagcgcgg agcggccgct gttcgcctgc gtcgctccgg gagctgccga 
841 cggacggagc gcccccgccc ccgcccggcc gcccggtgag tgggcccggg ggccgggggc 
35 901 gtccgcgccc gggctagggg cgctgcgagc aaagggggcg cgtcgcctgg agcgcgcgcc 
961 ggaccggccg ggggt ccccg gcgatgatgg cgctccccgc gcgcgctgcg gaccccgctg 
1021 accttggccg cgtcccgggg ggcgccgggg ggcccggcgg cgggggcctg agtggtacgc 
1081 gggagcccgg gaaccccggc gtgccggtcc cctctgaccc cgcgtctccc cgcagcccgc 
1141 cgccgccatg cccttctcca acagccacaa cgcactgaag ctgcgcttcc cggccgagga 
40 1201 cgagttcccc gacctgagcg cccacaacaa ccacatggcc aaggtgctga cccccgagct 
1261 gtacgcggag ctgcgcgcca agagcacgcc gagcggcttc acgctggacg acgtcatcca 
1321 gacaggcgtg gacaacccgg gtacgcgacc cctcggggcc ggggtcccgg ccccccctcc 
1381 ccccgcgcag ccgcagggtc ctcagcagcg cgctcgggcc cggcagtgac gtcactgtcc 
1441 ccgtcccgcg ccccctcccc caggccaccc gtacatcatg accgtgggct gcgtggcggg 
45 1501 cgacgaggag tcctacgaag tgttcaagga tctcttcgac cccatcatcg aggaccggca 
1561 cggcggctac aagcccagcg atgagcacaa gaccgacctc aaccccgaca acctgcaggt 
1621 gcggggctgc gggcgggccg ggcgggcggg gccggggtct tcgggcgctc actcccgtct 
1681 cgcctcccag ggcggcgacg acctggaccc caactacgtg ctgagctcgc gggtgcgcac 
1741 gggccgcagc atccgtggct tctgcctccc cccgcactgc agccgcgggg agcgccgagc 
50 1801 catcgagaag ctcgcggtgg aaggtagggg ccgggcgggc cgaggggcgg cggcggccgc 
1861 gtcccGctcc cggcgcggtc cccgcccgct tttgtttacg tcgcccggga gcggcagccg 
1921 ccgtcgcgct cttatctgcg cgcgcccggg ttcagtttcc cggacccacc gagggacgga 
1981 ggcccagccc ccgcgcccac agcggcctgg ggcccaggga gggcgggtcc tggcgcgggg 
2 041 tcaccgcctg ggaccgtcgc ccgggccgtg aggactggac gcccgcagat ccgggcgggfc 
55 2101 ggggccctct gaagtccGcc gaggtggggc acgggggcgg gcgggtccgc gctgcgggct 
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2161 ggaggggcgg gcgcgggagc ccagcgtcct gagcgcaccc ctcgcagccc tgtccagcct 

2221 ggacggcgac ctggcgggcc gatactacgc gctcaagagc atgacggagg cggagcagca 

22 81 gcagctcatc gacgaccact tccfccttcga caagcccgtg tcgcccctgc tgctggcctc 

2341 gggcatggcc cgcgactggc ccgacgcccg cggtatctgg tgcgtgtccc tctgcgccct 

5 2401 ctcgcggcgt cctccctccc cgctacctcc gctttccctc tcgcccccct cgcgggggtg 

2461 gggcccctcg cggcgaggag gaggaggagg aggagggagg ggccggccgc gctccgggtc 

2521 tgggttccgt gccgcgcctc ctcctgcgcc ggtgaccttg gccgagcagg tgcgttaagg 

25 81 gactgggccc cggcccgtgg gggctcagga ctcagcaaca cctccccacc ccgagacgtg 

2641 aggtgggggc ggggctctct ggcgcctctc cccgacggcc ctgggagctg gagctctttg 

10 27 01 ttttcttttc tcactcctcc gccgctggga ttctaccagg ggctggtgac gccaaagctt 

2761 ctccaggggc agggctccta cccccactgt ggggggcggg tcgggctgtc ctggcggtcc 

2821 ctggccccgc cccacctcgg gccacagcgc atgatggcag ctggggttct cctgctgtga 

2881 ggcgtcccgg ttcccccgcc cgccccgtgt tggcgggtgg agtcttggca gcagcctcca 

2941 ctcctgggca tggcagggag cagcacctca gggacttggg aagttccttt ggtctggggg 

15 3 001 cggcctgggg cttttttctg ggtatgccct gagaccagcc ctcccgcagg cacaatgaca 

3 061 ataagacctt cctggtgtgg gtcaacgagg aggaccacct gcgggtcatc tccatgcaga 

3121 aggggggcaa catgaaggag gtgttcaccc gcttctgcac cggcctcacc caggtgccag 

3181 ggacggggca ggcccagacc ccagggcccc agcagggatg tgggtgcccc agcatcagtc 

3241 cccccggggg atttccggca ctggggagtc tcagggcctg taggggtttc aggcaggcct 

20 33 01 tctccctcat accctcttct ccgtctgcag attgaaactc tcttcaagtc taaggactat 

3 3 61 gagfctcatgt ggaaccctca cctgggctac atcctcacct gcccatccaa cctgggcacc 

3421 gggctgcggg caggtgtgca tatcaagctg cccaacctgg gcaagcatga gaagttctcg 

3481 gaggtgctta agcggctgcg acttcagaag cgaggcacag gtgagcaggg caggt gctgc 

3541 ggcttcccgt ggcctttggg cagccctgtt tcctccgccc tgacttgctg tctccccagg 

25 3 601 cggtgtggac acggctgcgg tgggcggggt cttcgacgtc tccaacgctg accgcctggg 

3 661 cttcfccagag gtggagctgg tgcagatggt ggtggacgga gtgaagctgc tcatcgagat 

3 721 ggaacagcgg ctggagcagg gccaggccat cgacgacctc atgcctgccc agaaatgaag 

3 781 cccggcccac acccgacacc agccctgctg cttcctaact tattgcctgg gcagtgccca 

3 841 ccatgcaccc ctgatgttcg ccgtctggcg agcccttagc cttgctgtag agacttccgt 
30 3901 cacccttggt agagtttatt tttttgatgg ctaagatact gctgatgctg aaataaacta 

3961 gggttttggc ctgcctgcgt ctgagtggtg cctctccttt cccagggggg agggggaagg 

4 021 gcagcagcca ggccccagga gtcttgagtc ctgggcctgc tgtgggcctc gccttctgtg 
4081 agatgggaca agagccagga ggtggccact ctgttctgcc tgccctacct agtccatggg 
4141 ccccttccct cgtgtctatc gggctgtgca ggcaggaaca tgggagagag cgagggagga 

35 



SEQ ID NO. 36 



P14618 - Homo sapiens 
40 Pyruvate kinase Ml or M2 isozyme 



mskpb.se agt 
Iketniksgmn 
tglikgsgta 

45 islqvkgkga 
£asf irkasd 
ipaekvf laq 
Isgetakgdy 
f kccsgaiiv 

50 awaedvdlrv 



af iqtqqliia 
varlnf shgt 
evelkkgatl 
df Ivteveng 
vhe vr kvl ge 
kmmigrcnra 
pleavrmqhl 
Itksgrsaliq 
n f amnvgkar 



amadtf lebm 
heyhaetikn 
kitldnayme 
gslgskkgvn 
kgknikiisk 
gkpvicatqm 
iareaeaaiy 
varyrprapi 
gf f kkgdvvi 



crldidsppi 
vrtatesf as 
kcdenilwld 
Ipgaavdlpa 
ienhegvrrf 
lesmikkprp 
hlqlf eelrr 
iavtrnpqta 
vltgwrpgsg 



tarntgiict 
dpilyrpvav 
yknickwev 
vsekdiqdlk 
deileasdgi 
traegsdvan 
lapit sdpte 
rqahlyrgif 
f tntmrwpv 



igpasr svet 
aldtkgpeir 
gskiyvddgl 
fgveqdvdmv 
mvargdlgi e 
avldgadcim 
atavgaveas 
pvl ckdpvqe 

P 
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10 



20 
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H. sapiens M gene for Ml-type and M2-type pyruvate kinase 

1 ggtcttcaca ttttgaatgc gcaacattgt atctgtgaat gaaggcaaga gfctaacagct 
61 gtttaattga taactgctcg catcattagt tgctggctaa caactgggaa atcagaaaat 
121 gtcttgtaga aaaatgtaag aaaagttcca acaatactga cttaaacacg agcaaaggtg 
181 aaaacagaaa tgctgactcc tgcataggtt atcggcccta atgtfcctgac ttgatatttc 
241 cagatgccca gctctgcgct aatatcaaca ccgtctattt actttctact ctgaggcatt 
301 cgctctgcag gattccagac cctactaaat tattcacatg gccccaaccg gtccttcctt 
361 gttccgcggt cctaacacaa tgaatggtcc taagaggaaa acggcctcgg ctcccgctcc 
421 aggcccactt cgcagtccct agttctccct actgccgctc cagtgccaga gcccctccga 
481 aggcggccag gacctccaac cacgcacaag tctgcagctc tccccaactt tccgttcagc 
541 tcagtctccg agggtgcgcc agagcagaca cccggaggag tggggagtgg cagggcgggg 
601 ccgggagaat gctgccccgg aacccataaa ttcggccctg cccaggtagg ccgggacagc 
661 tggggtggcc tgggccgaga gccaagaaaa gagaccccat ctggacgccc aacttggcgg 
15 721 caacaggtgg ccggcgcccg ggggtctggg aggaaagtcg ctccgggcgg gccccgttgc 

781 cccgccgcgt ccccattggt catcaggttt cttaaaatgt gactctgaat ctgtgtcctt 
841 ccgccgcaga atttagtccc accgaaaggg caacctgccc gcgcgttccg ccaccgccgc 
901 cgcgcttcct cctgaaggtg actcgagccc gcggggacgc agggggcggg gcccgggtcg 
961 cccggagccg ggattgggca gagggcgggg cggcggaggg attgcggcgg cccgcagcgg 
1021 gataaccttg aggctgaggc agtggctcct tgcacagcag ctgcacgcgc cgtggctccg 
1081 gatctcttcg tctttgcagc gtagcccgag tcggtcagca gccggaggtg agcggtgcag 
1141 gcagtacgcG atcagtcccc accaagggcc agtcgcccgg ctagtgcgga atcccggcgc 

12 01 gccggccggc cccgggcacg caggcagggc ggcgcaggat ccctgtgcta aatggtatat 
1261 taaccacttc tcagtcttac cactctcttt caatttgtct cgacccagga cctcagcagc 

25 1321 catgtcgaag ccccatagtg aagccgggac tgccttcatt cagacccagc agctgcacgc 

13 81 agccatggct gacacattcc tggagcacat gtgccgcctg gacattgatt caccacccat 
1441 cacagcccgg aacactggca tcatctgtac cattggtgag tgggtgtccc ccttccccca 
15 01 aaaagggctt catgggcagt gacctttctc tcctgaaaag agctccatgc actttttaaa 
1561 gacttttgag ctatttggga gaggaaaaat tttcagggaa aaaaattctt taaacttaaa 

30 1621 gcaaacttaa atgtttttcc ttggttgaat aattaatact tgtggcttta aaacttttcc 

1681 taataggccc agcttcccga tcagtggaga cgttgaagga gatgattaag tctggaatga 

1741 atgtggctcg tctgaacttc tctcatggaa ctcatgaggt gagctgtggc tggaccctat 

1801 cctggcaggg gaattggagc tggattctag tgtgggagca cgcttgtcat cttccttctt 

1861 ttcccccagt accatgcgga gaccatcaag aatgtgcgca cagccacgga aagctttgct 

35 1921 tctgacccca tcctctaccg gcccgttgct gtggctctag acactaaagg acctgagatc 

1981 cgaactgggc tcatcaaggg cgtgagtatt ctgcggagag cgaggggaag gctcagtagg 

2041 caatatgccc cagagacatg attccttccg aggtgatgct gctactggtg tctccagttt 

2101 ggactcttCG ttactctctt gtccctagag cggcactgca gaggtggagc tgaagaaggg 

2161 agccactctc aaaatcacgc tggataacgc ctacatggaa aagtgtgacg agaacatcct 

40 2221 gtggctggac tacaagaaca tctgcaaggt ggtggaagtg ggcagcaaga tctacgtgga 

2281 tgatgggctt atttctctcc aggtgaagca gaaaggtacg tatgggagct ggagtccagt 

2341 tgtctaaaac agtcttttgt ctctaaactt ctcgtctctg cctccccaac ttaccctttt 

2401 ttatacaggt gccgacttcc tggtgacgga ggtggaaaat ggtggctcct tgggcagcaa 

2461 gaagggtgtg aaccttcctg gggctgctgt ggacttgcct gctgtgtcgg agaaggacat 

45 2521 ccaggatctg aagtttgggg tcgagcagga tgttgatatg gtgtttgcgt cattcatccg 

25 81 caaggcatct gatgtccatg aagttaggaa ggtcctggga gagaagggaa agaacafccaa 

2641 gattatcagc aaaatcgaga atcatgaggg ggttcggagg caagtccccg ttgtccctgg 

2701 tctactgcca tacttgtggc ctctgttcta tataacctct ctccccccca ctttgtccat 

2761 caggtttgat gaaatcctgg aggccagtga tgggatcatg gtggctcgtg gtgatctagg 

50 2821 cattgagatt cctgcagaga aggtcttcct tgctcagaag atgatgattg gacggtgcaa 

2881 ccgagctggg aagcctgtca tctgtgctac tcaggcatgt gcccaccctt ccccacattc 

2941 tcatgtgcac actcgcatgt ttgtatggga aagctctgga ggctgtctga tctcttccca 

3 001 tggaattgtc gcaacgtaac acacagataa tccccttccc ccatgtacct acacaaagcc 

3 061 atactctgtg tacctactca ctatccagag gatcagcttg ctgtcatttg tctctgaaga 

55 3121 cagctcaagc tacatctcac taatgctctg tcccctccca gatgctggag agcatgatca 
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3181 agaagccccc gcccactcgg gctgaaggca gtgatgtggc caatgcagtc ctggatggag 
3241 ccgactgcat catgctgtct ggagaaacag ccaaagggga ctatcctctg gaggctgtgc 
33 01 gcatgcagca cctggtgagfc tcfcgggcctg ccccatcccc cagggcttcg gactgggcct 
3361 gggatggatg caagctctgg tgcagagctt tttaggtttc tccatcctct tatgcacagc 
5 3421 ctttcattat cctccaagtt acagcagcaa gagggtgggg gtggaagtgg aggtggcttt 
3481 ttttttttct cctgttcctg cattcctgcc cacaccccca cccctctcat ttccttctgc 
3541 tctggaggca cctccttcat tggacaccac acagtttatt tcacttctga cttcaaggtt 
3 6 01 gtgaattctt cccatggctt aagtcctggg atacttctgc agfcgaaagga ggtcttgtac 
3 661 ctcttcctca gagtcagaag ttctgagtac ctttgcccta ttctgaaaag ggctaggggc 
10 3721 tcctgctccc agctgcccfcc ttcctttggc ttccaattca gttccctctg ccccgcatcc 
3 731 tgcagacagg cgctcccgca gggggccctt gtggacctgc actggagtct gttgccttca 
3841 ctgagctgcc tgtgctggcc ttgcatggtg cctgtagggg gatttgcttt gctgtgccat 
3901 tggggtacag ctgctgctct tactctagac caaaaagtcg ggttgagtga ctggtggcag 

3 961 ggccaagata gagacagcgg ggagggtggc tgaccctggc ggccctggac tgagcgtctg 
15 4021 gaggagtcgt ggaggctctt tcccttcttt ctcctctgag agctcgttct tcaggctctt 

4081 ccagcttgtc atgtcgagtg cctggccact gctcagggtt ggaggctcag tccctttgcc 
4141 ctgtctgttc cagctctgga gctaactcag ggatccctga tcagggttac gtaggtttgg 

42 01 taaaatgagt gctggaaatt aactttctcc cagtagtctt aggtctagct cagtgaactt 
4261 aaactttatc cagatatggt ttttccttca gcctttctat tccctttcta gccagtgaaa 

20 4321 gacccgctgc cctttgacct cagccccctc caagccccca agtttaaaac gccaccccct 

43 81 gccaccagaa aaaacagaaa aaaaaaaaaa aaaaaaaact aaaacaccca tctggtctgg 
4441 gcatcttcct tcctttttca cfcatgfcatcc tgttactggg cttaaacagc tttcagagaa 
4501 gagatgtcat ttctattaaa tgctctttca gtagcgaact gagttcacac ttgactaagg 
4561 atattttccg gactgtctgt catcagcatc cttagtgggt ttccccatat ttaaattggt 

25 4621 agaggccagg gatggtggct cacacctgta atctcagtac tttgggaggc caaggtaggt 
4681 ggattgcttg agctcagaag accagcctgg gcaacctggt gaaaccctgt ctctactaaa 

4 741 aattcaagtt agctagctgg gcatggtgat gcacttctgt agtcccagct acttggagag 
4801 ggggtggtgc tggggcagca ggatcgctta aacccaggag gttaaggttg cagtcagcca 
4 861 agatggtacc agcctaggtg acaaagtgac accctgtctc aaaaaagaaa ccaaacaaac 

30 4921 ataaaaaaaa aaacaaaaaa atcggtagag agtgatttct ctcccaggcc cacbtaatgt 
4981 agactgggcc tggctgacac ctcaccattc gtgtgatgtg attgctgttc tgatgcttag 
5041 atactcttgg cgcagtctca caattgccac catggtagga aggtgtccag gagacggtgc 
5101 accttgaacG agtcaccact aaagtggctg cctttctggg tctctccaca catcccctct 
5161 ctctaatttc cctacttaat cgtgtgaatt catggtctca aaggaggaac agaggctgat 

35 5221 cttgacttag atatactgaa ccatgaaatc actgcataga atgtggggac ttgaatgtgt 
5281 ctttgggcaa gtcatttaac cfccttaagac ctcatctgta aaatggatta gatatgttta 
5341 attatagcct tagcattaaa tattcattgc tgttattatt aagtgtctga taagtctctg 
5401 tgtacatgga tgtaatcttc ctaactccca ttacctccat ttatagatga gggttatatg 
5461 gccaataaag cctgggtttg aatctaggtc tactgcctcc aaagccagtc ttctctcctg 

40 5521 caacatcatg ctctgtctag caggagatga gaacaggtct ccatttggag cctgtcagtg 
5581 gggtcagaga ctaagattca ggctcagggt ctaaattccg tatcctttct tccataccct 
5641 ggtgtttcct atgaacagat agatacttta gggctgcaag gtttggattg catggcactg 
5701 ctcagaagat aagttacagg tctgggctag gctgtagctg cccctccagg tggctagacc 
5761 tttcctttct gtgtcaccag ttaacactgg ccaacagttc cttccattaa ctgttcactg 

45 5 821 ctttctcctg tgtctaactg atgcagttta tgacccataa ctaagagcag taccaggtat 
5881 ggctctgttt cctgttcatg tcccctgtcc tctgggctgc atgcattccg ttcttacaga 
5941 aagaatacct ttaacctagt acatcctgcc acacatctgc ttctactgtg aaattgatga 
6001 gggggtatta ccgattcttc cctctcccat catttactga gatgctggtg attgcattat 
6061 aatcctctta agcttacatt gtctttctga ttcttggtct tatctgagca agtgatctat 

50 6121 aaataactca gfcggcttfcct catgactgtt ttaatfcatta gafctfcfcaatc aagtgtctta 
6181 ttaaatatat ctgcatgctt ccacaggcat ctgtctcttc acatggctgt tcagtgtgcc 
6241 tctcacaact tagcccaaac tcagttgagc tgccttgctt tggctttgac ccagctttcc 
6301 agcgctgctc aatctgttgc catggcaggc cattggaaag gctcagttca tccccgtgcc 
6361 tgaagccaag tgagcgctca ctccatgcat gcatggaggc tgggcaggag cctgcctaafc 

55 6421 caaccagcca tgtgaggagg gagggcctgt tccttcctgt aagctatgtc atgaggcagc 
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6481 gtggtcaagt cctctgccag ggagtggcct ggcccagcct gggcatgttt tcatgccagg 
6541 gtgctagagc ctactgccag attgtctccc tccaccccca atgaaaaaat ccttccagaa 
6601 gggaagagcc aatttcccct gtattggagg ggaagtggca gcacctcctg aagcagttgg 
6661 actttcatca ccctacctct gcatctgcct gaaggacaga tttagccaat taacctaagg 
5 6721 ttaccttcct ctctgataaa ttccccattc tgtcttccca tgtgttgtgt ctcgtttttt 
6781 tcctcctcct tccctcttcc ttgccccctc ttcccctaaa ccfctacagat agctcgtgag 
6841 gctgaggcag ccatgttcca ccgcaagctg tttgaagaac ttgtgcgagc ctcaagtcac 
6901 tccacagacc tcatggaagc catggccatg ggcagcgtgg aggcttctta taagtgttta 
6961 gcagcagctt tgatagttct gacggagtct ggcaggtagg gccctaaggg caggtaacac 
10 7 021 tgttaggata accagcctct tgctgcacct gccccaggag aagagagaag gcccaacctg 
7081 gcatctggga acagagcctc ttctcgtctg taggaacacc gccagggagg tcatggcagg 
7141 gcagaccaaa gggtcctgtg gctcagtagg cacagtagat gtcacaggca cttggtgaag 

72 01 gactggtttc tgtggagtct tgatcttggc tcagctcaga atctccagtg attgggctcc 
7261 tcttggcctfc tgttcccagg aacatgttcc tcaccagctg tccggtgact cttcccctcc 

15 73 21 ctctcctttt gtgacaaagc tctgacaaag ctctgtcccc ctctcgtccc tctggacgga 

73 81 tgttgctccc ctagattgcc cgtgaggcag aggctgccat ctaccacttg caattatttg 
7441 aggaactccg ccgcctggcg cccattacca gcgaccccac agaagccacc gccgtgggtg 
75 01 ccgtggaggc ctccttcaag tgctgcagtg gggccataat cgtcctcacc aagtctggca 
75 61 ggtaggaggc ggcagcggct ccctggaatg ccctgctcag tggtacctca ccttgggggt 

20 7621 cctgggagca gtccattgaa caatgctcag gtggcactga gccaaggtaa gacccctctg 
7681 cctgccacct tgggcctgca gggaaggatt gagcagagcc ccttcccagg gcccaaagga 

7 741 ctctaggtag cactcataag gaatgtcaga acatttggat caaaagcaaa tttatgctgg 
7801 agatttatta cataacagtg cacaggctga ctacaaatgg ttatttgata ttgaaaattt 
7861 agtcctctaa aattgtaaaa gataccactt ttgcttattc cagttactat gtgctcttta 

25 7921 aaaatttcag ttgggaaatg aatttattta aatgctgttt actgtgcctc catttggcac 
7981 actagtccct gctgtfctttg agccctaaag acaaattggg ttccagctca ggagaggttg 

8 041 ctgtgctatc ttggctgaca ttctgtgggc ctggcagcca ggctgaggac tgtgtggcct 
8101 atgctgggcc tccaacttgg gatcccttcc ttggcccagg acattgagtt aatgtccttc 
8161 actctcctag ttagggagta tgctccttgt ccctgtccac aggggagcaa gggtttcctg 

30 8221 gaagagggga gcaaacaggc agtgcccatg cactgaggag cagcagatgg gcgtgggcag 
8281 GGcagagaac caggacacaa gctctgtgca gatccctcag cagagggctc cagcctccca 
8341 ctcttggctg aacagctcca acccgtaggg ttgacctttc ttaaaaggtc cagttcttgc 
84 01 tgtttggcta ttttaagctc tagtcttctg gggtttcact cagctggtcc tggcttcagc 
8461 aattgcttcc ctctgaaggc cttgcataga ggccaagcgt gaagtgcagg gacttctctg 

35 8521 ctgtgatgtg gcttaagttt ccctgacacc tgttgagtgt cctcataact tcccttctgg 
8581 tgcccctccc cagctcctga gaccagctgc agctacaagt gtgcagtgtc agtgttcaag 
8641 aaagtgcctg gcagaggggc tttagaaggg tcccctgcct tccaaaggag ctttggcagg 
8701 cagacgtgct cctgcagcaa cactcccatt tcctgttctt gcctgctgag tagcacctag 
8 761 atttctaagc ctcatctaga tactcagatt tgattctggg cctttatagc ccagttgctg 

40 8821 ggactgtttc aggagctagg ggccatgtgg ggcagggaga gggcacaaaa gtagagaagc 
8881 ctgatgttga ttcccagggg gctggtcagc tctgctactg ctccttgcag atgtcaagag 
8941 tcaggtgcta gtcacgtgct gcttggcttg tcactgtcat tggcagcgag aggaatgggt 
9001 gctggtgaca ttgggccagg gctgcctctc tgtgtcagag ttcagggtgt aggaggggtt 
9061 ctgccaacca tgggctgtgt ggggtaagtg ggttgaggct gatctttctg ggtcaaggtg 

45 9121 atcctgagcc cttgcctgtg gaatgggggt agagggcaat ggtaacctag ctagcatgct 
9181 gtgggggata taggatgagg ggctgcccga ccctcgggag gggtcctagg gagcagatgt 
9241 tgaagaggcc agagccctca gtgagctgga tgagggggtg agccgtttga actccctgag 
9301 ggtacttcct ggggcctcgt gtaatggtct cttctgtatg tcccccatcc catctcaggt 
9361 ctgctcacca ggtggccaga taccgcccac gtgcccccat cattgctgtg acccggaatc 

50 9421 cccagacagc tcgtcaggcc cacctgtacc gtggcatctt ccctgtgctg tgcaaggacc 
94 81 cagtccagga ggcctgggct gaggacgtgg acctccgggt gaactttgcc atgaatgttg 
9541 gtacgtggct ggagcagggg ctagagccta gaggagcttg gggatgcttg agcattggcc 
9601 accaacctcc cttctcttcc tccaggcaag gcccgaggct tcttcaagaa gggagatgtg 
9661 gtcattgtgc tgaccggatg gcgccctggc tccggcttca ccaacaccat gcgtgttgtt 

55 9721 cctgtgccgt gatggacccc agagcccctc ctccagcccc tgtcccaccc ccttccccca 
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9781 gcccatccat taggccagca acgcttgtag aactcactct gggctgtaac gtggcactgg 
9841 taggttggga caccagggaa gaagatcaac gcctcactga aacatggctg tgtttgcagc 
9901 ctgctctagt gggacagccc agagcctggc fcgccccatca tgtggcccca cccaatcaag 
9961 ggaagaagga ggaatgctgg actggaggcc cctggagcca gatggcaaga gggtgacagc 
10021 ttcctttcct gtgtgtactc tgtccagttc ctttagaaaa aatggatgcc cagaggactc 
10081 ccaaccctgg cttggggtca agaaacagcc agcaagagtt aggggtcctt agggcactgg 
10141 gctgttgttc cattgaagcc gactctggcc ctggccctta cttgcttctc tagctctcta 
10201 ggcctctcca gtttgcacct gtccccaccc tccactcagc tgtcctgcag caaacactcc 
10261 acGCtccacc ttccatttcc cccactactg cagcacctcc aggcctgttg ctatagagcc 
10321 tacctgtatg taataaacaa cagctgaagc acctgtttcc tctctttt 



SEQ XD NO. 38 



15 Q01995 - Homo sapiens 
Trans gel in 

mankgpsygm srevqskiek kydeeleerl 
ilsklvnsly pdgskpvkvp enppsmvfkq 
20 kdmaavqrtl malgslavtk ndghyrgdpn 
snrgasqagm tgygrprqii s 



vewiivqcgp dvgrpdrgrl gfqvwlkngv 
meqvaqflka aedygviktd mfqtvdlfeg 
wfnikkaqehk reftesqlqe gkhviglqmg 



SEQ IB NO. 3 9 

25 

D84342 

Homo sapiens DNA for SM22 alpha, complete cds 



30 



1 ccgggtgaaa gcagagtgct ccctgaccct ctgcccctcc ctcctccacc ctggcctgct 
61 ttagctttcc ccagacatgg ccaacaaggg tccttcctat ggcatgagcc gcgaagtgca 
121 gtccaaaatc gagaagaagt atgacgagga gctggaggag cggctggtgg agtggatcat 
181 agtgcagtgt ggccctgatg tgggccgccc agaccgtggg cgcttgggct tccaggtctg 
241 gctgaagaat ggcgtggtga gtggcaccct gggctagggc gctggggggc tggggtgtga 
3 01 ccccctgtga gtcctgggcc aatccctgag gactgctaag ctgcgtccta tgccctatgc 
35 361 ctggtagafct ctgagcaagc tggtgaacag cctgtaccct gatggctdca agccggtgaa 
421 ggtgcccgag aacccaccct ccatggtctt caagcagatg gagcaggtgg ctcagttcct 
481 gaaggcggct gaggactatg gggtcatcaa gactgacatg ttccagactg ttgacctctt 
541 tgaaggtaga gaggagaatg ctgggggagg aggtgggcag gaggacaggg tgctgggaca 
601 gggagagggt atgaccaaat atgccacaac taggggtgtg ctcgcccgca cacagcaggg 
40 661 atgggatatg ccgagaataa cacgccacgc tcacagggcc cactgagagg cctcccttga 
721 attggggaca actcttggcc ctggtttggc catttttttg tgagagacgg gggcaggccc 
781 tggcttggag tcttgtttat acgttcttga tgttcatctc ctctctcctg tcttctcaca 
841 ggcaaagaca tggcagcagt gcagaggacc ctgatggctt tgggcagctt ggcagtgacc 
. 9 01 aagaatgatg ggcactaccg tggagatccc aactggtfcta tgaagtatgt ggcccccagg 
45 961 gagcttgagt ctccgcatgg ggtgggaggt ggcttgttct aaggagcttg cgggaaggat 
1021 taggggaagc agatagccaa gaaaggataa agtgagggtc tgggatgggg aataatgggt 
1081 ccttaatact ccttgacccc tccctttcca ccctcctgcg ctcagtctcc ctagcctatg 
1141 aggcaagcta gattagggaa aaaaagtgca acaggaaggc aatgggattg ggctaggacg 
1201 taacagaggg atcagaaaac gggtggaaaa cacacagttc taccaagtct ttatcctgct 
50 1261 tcctcctctt ctaggaaagc gcaggagcat aagagggaat tcacagagag ccagctgcag 
1321 gagggaaagc atgtcattgg ccttcagatg ggcagcaaca gaggggcctc ccaggccggc 
13 81 atgacaggct acggacgacc tcggcagatc atcagttaga gcggagaggg ctagccctga 
1441 gcccggccct cccccagctc cttggctgca gccatcccgc ttagcctgcc tcacccacac 
1501 ccgtgtggta ccttcagccc tggccaagct ttgaggctcfc gtcactgagc aatggfcaact 
55 1561 gcacctgggc agctcctccc tgfcgccccca gcctcagccc aacttcttac ccgaaagcat 
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1621 cactgccttg gcccctccct cccggctgcc cccatcacct ctactgtctc ctccctgggc 
1681 taagcagggg agaagcgggc tgggggfcagc ctggatgtgg gccaagtcca ctgtcctcct 
1741 tggcggcaaa agcccattga agaagaacca gcccagcctg ccccctatct tgtcctggaa 
18 01 tatttttggg gttggaactc tc 



SEQ ID NO- 40 

Q14103 - human 

Heterogeneous nuclear ribonucleoprotein 



mseegfggdg aaaaataavg gsageqegam vaatqgaaaa agsgagtggg tasggteggs 
aesegakida skneedeghs nssprhseaa taqreewkmf igglswdttk kdlkdyfskf 
gewdctlkl dpitgrsrgf gfvlfkeses vdkvmdqkeh klngkvidpk rakamktkep 
15 vkkifvggls pdtpeekire yfggfgeves ielpmdnktn krrgfcfitf keeepvkkim 
ekkyhnvgls kceikvamsk eqyqqqqqwg srggfagrar grgggpsqnw nqgysnywnq 
gygnygynsq gyggyggydy tgynnyygyg dysnqqsgyg kvsrrgghqn sykpy 



30 



20 SEQ ID NO. 41 

AF026126 

Homo sapiens heterogeneous nuclear ribonucleoprotein D (HNRPD) 
gene, complete cds 

25 

1 tcgcagaggt gcagccacac cccggcctaa cgtgttgttc cccccgatac tggagtggtg 
61 gggagggtga gtggactcca ggaatcctcg gaagggcggg ggcggaggca gggggcccct 
121 ctagccgcta cttcgaaaca gcattccttg ttctcgatgg tccccgcgcg actgtcttag 
181 ctcacgacac ttccggttcc ttttaaaggc ccccaaggct gtgcaacgcg gagcgtgaga 
241 ggaaggtata aagggtagcg agagggcggg accgaggagg aaagggaaaa aaaaaaaact 
3 01 agggggatag gggtgggggg acgcgcgaag ggcgcgctct cgcgtcacgt gaccgggacg 

3 61 cgccgttctt ccgtcggcca ttttaggtgg tccgcggcgg cgccattaaa gcgaggagga 
421 ggcgagagcg gccgccgctg gtgcttattc ttttttagtg cagcgggaga gagcgggagt 

4 81 gtgcgccgcg cgagagtggg aggcgaaggg ggcaggccag ggagaggcgc aggagccttt 
35 541 gcagccacgc gcgcgccttc cctgtcttgt gtgcttcgcg aggtagagcg ggcgcgcggc 

601 agcgcgggga ttactttgct gctagtttcg gttcgcggca ggcgggtgta gtctcggcgg 
661 cagcggcgga gacactagca ctatgtcgga ggagcagttc ggcggggacg gggcggcggc 
721 agcggcaacg gcggcggtag gcggctcggc gggcgagcag gagggagcca tggtggcggc 
781 gacacagggg gcagcggcgg cggcgggaag cggagccggg accgggggcg gaaccgcgtc 
841 tggaggcacc gaagggggca gcgccgagtc ggagggggcg aagattgacg ccagtaagaa 
901 cgaggaggat gaagggtgag taaggggcat cccaggatag tcaggcccaa ctagtccccc 
961 tccccctctt tattccGccg ccattagcgc tggttcccgc tctcagcccg ttctccgggc 
1021 cccatgcagc ctccttgcac tggtctccct ttttctatgt tgggttcccc ggaccttcat 
1081 ttccttcttc ctttgcctgc ttatcgcccc cctcccccat cacacacgtt tccaccttta 
1141 gccgtcacca tgctgctcct cggcccgccc tcctttcctc cctcgccatg ggtctcctcc 
12 01 caccgactta gccgccagat tttttcccgc ctgtcgtggg gtaccttttt fcctttccatg 
1261 ctgtcccGtc tttttccctt tttctacagt cttgggcata aaaacacaga cacaaacagc 
1321 ttctctgttt gtattactaa ggtttatttg gtgcttctcc accatcctga aacgatgcga 
1381 ttgtttataa gcacatgttt tggggaacgc gtaggctgtc cacctctgcc tctcccttgt 
1441 cggccttacg ccgactgttt tcttgggatt ttgaataagt ttcgcctaag gatttactca 
1501 ttttctccac catacgctat cgcacaatgg catactcata caggccctac attttgacat 
1561 gcagaccaaa ttgggtctgg tgaaatgctc cgagtttctt gttggtacat tggttttgct 
1621 ccgcgggctc tggttaagtt cttagtcgat cgggcctgca cttgactgga gctgttccta 
1681 tctccggcct agcatctacc cttcccccac cccactgagt tattctaacc gcgcaccctt 
55 1741 ttcgcgccct cagctattgg gttccccaac tgttagtaca gattgtacct tactttttat 
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1801 gtgcaacGta attttttaca acccccagcc cccttttttt cccggtgccc aggatcctat 
1861 tttggtgtct tgatatctgt ttcccgccca ctaagggagg cctgtagtcc tcttaagaga 
1921 aacaaatcac tgtattgtgc tatgggatac tttttttttc tttttggtgc aaatttctta 
1981 gtattaactt gcttcctcca attgaaataa cagttgtata tactacctaa atctgcattt 
2 041 agtgtattaa aatgcagcat ttgggattgg gaacataata cgggagttag caggagggac 
2101 taaatcaggt ttcttggttg agttttctta gctccgatta ctgaccgatg atcgtgggtt 
2161 cacgcttcca ttcctttttg taatggggag gggtgtgtgt gtgtctgaat ttttttttaa 
2221 ctgtaaagag aaaggfcgttg gtggatataa caagatgtcg ctttaatcta gtattagaaa 
22 81 acacgatttc ttttactgag aaagagccca ggatttggag ggaaagttgg gaggggagaa 
2341 gcactttgga atatagggtt tattaagccc agcttaggca gattcttgtc aggctgtttg 
24 01 atgatagtgc tggttttggg ggagtggtgg tgggaaggga agcaaacttt taaaaattaa 
2461 acagaaccaa ctcgaattgt atatagctta tttgagaaag gaagatgtta gafcatttgga 
2521 aacttaaaac ctttaacatt ttggttttcg tttgctgaat tctgttcttt ggacagaggc 
2581 accaaaatga ttaattgtat aacttcctgt ttggggcagg ctttctgtta gccctgataa 
15 2641 ttcaaatcgc aaagcagctg actttatagt tacctactgt tgaagtgtaa atgaattaaa 
2701 gtttttaacc ctaacagtgt caaaaactaa aactagaatt ttgattggtt gctgtaccgg 
2761 ttgttaattc cctagccatt caaactcctc cccacgacac tctgaagcag cgacggcaca 
2821 gcgggaagaa tggtaaatct tttaaatttt atttctgtat tataaaggtt tcagtagtca 
2881 taatatgtag aggctgtgta ttggttagtt gcccattgat ccccaggaaa ttctagacca 
2941 tagtacatac agtaggtttg tacctgggat atataccttt tataggcatc atgtgtaatc 
3001 tttgggcaaa attgatgccg ttacatggta tttggtctgt ggtaatgcat gtagttgcag 
3061 tttggcaatt taagctaaac aattagtaac gafcattttta tatcgagcaa gaaaaaatat 
3121 taccgtaatc ttcacatttg atttatttta agatgaaaaa tatttttgtt ggaatfctaag 
3181 tttaatggct aaatatgaag actccttgat ataaaatatt gtgtttaaaa tgtcatgcat 
25 3241 gttatactga aatattctgg agatgtctga agcagttctt taagcagcac tgtgtattat 
33 01 ggaaaaatgt ttgtctttta cctagcatat tfcgtaaafcag acctttaata aaatgtgtgt 

33 61 gtgtgtgtgt gtgtgtgtgt gtgtatatat atataccatt aaaaggtggg ctaatgtggc 
3421 tttccactac agctcttacc agatttttta ttttcagtgt gatttggctg gaatattaat 

34 81 aacagttggt tgacatgtat tcacaattgc atgacttctc tttggaagct ggtaggttga 
30 3541 gtattgttct tttfcatccat cctttcccct cccctaaatg tcagfcctttc tgcttttagg 

3601 attatatttg aattttcagg ggtttataca ccaccactgg agagtagaag gccactaact 
3661 tgcaggcagt taacttatat ctttacaagc tacattctat tttaagagcc tatttagggt 
3 721 aacatttaaa gatttgctta tcactgtatt ctcatccact gaaatcagtc cagctttacc 

3 781 ttggagaaaa agcagatggg aaacaggcaa ggggaaaagc aaaaaagagg aaaaaaaaag 
35 3 841 actggggtca cagtttcgat gaatgttatt ccctgtattt ggtgagtggt gggcactagc 

3901 agctcaattt cactgctagc taatagtggc aaattaagat taaactataa actatttgac 
3961 attactccat tatgtatttt gagtctcatg taattgtttc cagtgatttg aaatgaccat 
4021 aaacttaaat ttgaaggtca gtgttttgac tcttcgaagt aattacactt gacctgctac 
4081 ctaaaacgat gtaatattac aacttttttg aataatctca ggaaaaatgg agaaatgttt 
40 4141 atattcatag ttaataaaca ttctagatag taaccaactg tcatttactg aaaataaaat 

4 2 01 tttacccagg tttattttat ttttgagacg gagtttagct tttgttgccc aggctggagt 
4261 gcagtggtgt gatcttggct cagcgcaacc tccacctccc aggttcaagc aattctcctg 
4321 cctcagcctc caaagtagtt gggattacag gcatattcca ccatgcctgg ctaattttgt 
4381 atttttagta gagacagggt ttcttcatgt tggtcaggct ggtctcgaac tcccgacctc 

45 4441 aggtgatccg cctgcctcgg cctccccaag tgctgtaatc acaggcgtga gccactgcgc 
45 01 ccggcctagc caggttgatt ttaaaattac acttaaaaat aatgtfcctca ttttfcaaggg 
45 61 ctctaatatt tgacttttct taactttcca attatgcagc ccfcatcctgt tccagacgtt 
4621 aaggctttct ttttgaatga aaaactttca gacttttttc tctctctttt tttttttttt 
4681 tggtgtctgc actccccacc gtafcctgtcc cttcctcttt ctcccatttt gagggatatt 

50 4741 ataagggcaa cagttttata gtctcaatat tgagggtaag actggfctaafc agatagagta 
4801 tccaaattga atttattaaa cacagttctg tgtgcatttc cttctgcaaa tctgagaata 
4 861 aagtgattac agtttcatcc taaaatactt taaaaatcag ttggttttag aaataggttt 
4921 ttttcctttt gcatgtaaga afcggacagct atfctttagga aggcatgcac ctgttatccc 
4981 agctactcag gaggctgagg caggagaatt gcttgaaccc gggaggtgga ggttgcagtg 

55 5041 agcctagatc atgccattgc actccagccc gggcgacaga gtgagactct gtttaaaaag 
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5101 aaaaaaataa gtatttgtaa atgtatatat atatatatat acacacacac acacacatat 
5161 atatataaaa tcattatgcc atcagtttga gggagagcat gtatgccaac attgtaaaga 
5221 ctaagtgccc atttatccat aattaagata agcagacttt tccccattgt actataaagt 
5281 tagagtttgg ggctgggtat ggtacttccg cctgtaatcc cagcagtttg ggaggotgga 
5 5341 gcgggcggat cacttgaggc caggagttca agaccagcct ggccaacgag gggaaacccc 
5401 gtctctacta aaaatgcaaa aatcaagtca ggcgtggtgg tgcagcctgt aatccctccc 

54 61 acggaggctg aggaatgaga atcactttga acctgggagg cagaggttgc agtgagctga 
5521 gattgtgcca ctgcgctcca gcttgggcga cagagtgaga ctgtctcaaa aactaagcaa 

55 81 ccagaaacag aaaaagatgg aatggaatta atatgttcca tttagtaaag atggaataga 
10 5 641 attggtccag tttgtgttca ttgtagcata ttcacctatt tggtctgcaa ccattttttt 

5701 ttttttaaag cctaagcata ccctgagaac aacaaccttt tttgattaga acagaagaaa 
5761 aagccagaat gttcttgggc tgccaaaata ttaatacgtt tgtctcaaag acgaaccact 
5821 taggtgcttg cttagagact tctactttac acagaatttg aaaagtttga cagctggcta 
5881 cttactctgc aaaacggaag ggagactttt tagaacagca actggtgatc tggcaaaaat 

15 5 941 gaaaagtaga acttgtgaag aaatgagatg gacttggtgt attaatagct aattttagaa 
6001 gaccctgtgg atattctaaa tcagcataat agtgatgtct gagccatagt gttaaattat 
6 061 gattactgta tttgaagtat aagaaggcag aaaagtgtga tgctgttaga aaaaaatctc 
6121 attfccaaatg attgaaggtt aaaaacaatt ggggaaagat tagagagggc aggcggtgct 
6181 ttaagggagg tggaataata cttggctcct tattctccaa aagtgctgga tagctgaagt 

20 6241 ttttaatatt ctggataatt gatatttgat tattagttta ggagggatag catcttcagt 
63 01 gaacccttgt agctctaagg taggcatatt aaacaactac catactaaag gtggggagta 
6361 cgtttggaga gcactttgct gggccagctg atgafcafcgct taggtgatat ttaagaaaat 
6421 ctgcttcttg tctgaaaaac atttggggct tcagaataat accacatacc tcattcttgg 
6481 tgttttaatt tatttctttg taattgtttt ctgtttctta ataattttaa attaagggta 

25 6541 ttttcccaat ataaacattg ctttttggtt aaataaaatc aaaactagtt ctgtctccat 
6601 atctgaatag accagacaga gaattttctt gcctttcaac agcttaaaaa atgtttttgt 
66 61 tagaactgtt gtgttcaggc tgaaatactt tcaaagtttg ttagttatta ttgagaaatt 
6721 tctgtaataa tcatggaaag ggtaatttaa tagttaaatc tcaacttaat tgaagttatt 
6781 tctgttgtgg aacttatggt catcttagca agaggtcatt gctttatagt cagttctctt 

30 6841 tctcattaaa aatatagtgt actcaccttt acagggtgaa tgttggtaaa taacttctgt 

6 9 01 ctgtgaagga agtattctgg acctgtaagt taaaaataag gtgtttatag caaattatgt 
6961 aaataaagat tgtatattag aaggtacaca ctattcaaat ttaaagaaaa tgtatattga 

7 021 gaaaataact caaattcttc catgaaattg gcaagaagta aacatttcaa atacacaaaa 
7081 cattatgggt atttttgttc attfcgattat aaaatgccaa agttgtttta taaatgctac 

35 7141 ttcaagcctg gaaactttga ggaagtcctg aacattaagc atataaatgg cccagctcta 

72 01 gaatacatgt aagttgaaaa gctaacctga agtgggaagc gcagtatata cctaagactt 
7261 actctgcact gaaagtttgc tttgtcacta gaagtaaaac aagactgtgg taggatagta 

73 21 agatcagtaa cacctcagtt aatcaggtat cttggaggaa gtgaagaaag accctaattc 
73 81 aagggacagt taattggcct tttattccaa gaatgggcct tagtggcagt atcttaaaag 

40 7441 cccacaagat ggagatgttt cctaatgaaa ggcctttaat ttctttatag agctgagtta 
75 01 gtgtcacatc ccagtcccca cccatgaccc ttccccagtt aaaaagaaga gaaaatgttg 
7561 agcaagtctg atttgattcc catggtgaca tttttagcca ttatgfcaaca aattcfcgaca 
7621 gtttaccctt aaaattaaaa acctccagtc ctgtcttttt aaagggtaga aagaaggtta 
7681 ggtataggat agctttttat ttatttattt atttattttt gtacaaaggg agcctatgta 

45 7741 aagctgccag atctgaactt tctggtgttt tgctgtaatg ttagtaagat ttcgccttaa 
7801 aatattttat tttgagtata tacgtttggt cttagagtgt cttggtggat ttccgcttac 
7861 cacGcatcac tttctgcatt ttaaaggctt aaatacttta ttgctggtta actgaggttc 
7921 tactgtaaac gaatcatcta agttaattag tggattgtac ttcaatggat aattttcact 
7981 aaattgttta tattgcacat tacttttgtc ttaaggactc ttagcacatc aaaaaaattg 

50 8 041 gcfccacaact taattgtgag atgtagaatt ttccafcttta tgtgctaaga gttttgttaa 
8101 tgagagaact gttaaaatag aaaaggagct tcagcataga caccaatgct ggttgctgag 
8161 atcagtgggg aactgcatag cattttaaca agtttaatga acatttggaa gagaaatttg 
8221 aagctaagat tctggttttt ttgctgttaa ctttttaatt ttttaatcta aggaaaactt 
82 81 ttatgtacag tatttttact ttgggtatat gtfctatctfct tagcaagttg aaagacttaa 

55 83 41 tttgctgctt gctcactatt ttgtaattat ttgggagcag cagtaataag ccagcttttt 
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8401 ggaataggat gttcctgatg tgtggttatg taggaagaat gatgttttaa tatactgccc 
B461 agtaaactgg tgcagtttgg aaaaggtgtg ttattgatgt ggataatatt taaggcaatt 
8521 ttttttaagc attttaatac tgctttttgt ctttacagga aaatgtttat aggaggcctt 
8581 agctgggaca ctacaaagaa agatcctaag gaaaactttt atgtacagta tttttacttt 
5 8641 gggtatatgt ttatctttta gcaagttgaa agacttaatt tgctgcttgc tcactatttt 
8701 gtaattattt gggagcagca gtaataagcc agctttttgg aataggatgt tcctgatgtg 
8761 tggttatgta ggaagaatga tgttttaata tactgcccag taaactggtg cagtttggaa 
8821 aaggtgtgtt attgatgtgg ataatattta aggcaatttt ttttaagcat tttaatactg 
8 881 ctttttgtct ttacaggaaa atgtttatag gaggccttag ctgggacact acaaagaaag 
10 8 941 atctgaagga ctacttttcc aaatttggtg aagttgtaga ctgcactctg aagttagatc 
9001 ctatcacagg gcgatcaagg ggttttggct ttgtgctatt taaagaafccg gagagtgtag 
9061 ataaggtagt gtgttacgtg ttctgatcag ttaataatat aaaatattaa catatggata 
9121 gtttgataca atgagtttgc ctatttgtgg ttccccattt tgatagtata ggaaggaaga 
9181 atagttcttg ccccaatacg ttttatgaag atagaggtag gttcaggaat tatttcctga 
15 9241 ataatttgtg ttccaggctc tgctaaattt tgaaattaac tttaaagata ctatagactt 
93 01 aaagatgcct agttaaaagg atgtgtttta gcaattcaca gaagtccata ttttgaaatt 
9361 ttgttaggca agcaactttt aactgaatca ttattttgat cctgggctaa agggaagtag 
9421 cagttatgtt tgtatatagt gctaaaggga agtagcagtc atgtttgtat atattgaagg 
9481 taatggttat ctagtaattg gttaaatfctg tgtatgtcct accattctta cctfctagatt 
20 9541 taaacagtat atgttagttg atgttacatc accaccatga cttgacagtt taatcttgag 
9601 caagtcaaca tatgcttggc attatctgta tttatagtta tttttaagta agttaatagt 
9661 ctctgaactt cagttaagat atatattttt taaatgaaaa cttcaatttc cttaggtcat 
9721 ggatcaaaaa gaacataaat tgaatgggaa ggtgattgat cctaaaaggg ccaaagccat 
9781 gaaaacaaaa gagccggtta aaaaaatttt tgttggtggc ctttctccag atacacctga 
25 9841 agagaaaata agggagtact ttggtggttt tggtgaggta tgttataaat gttttgaccc 
9901 agtttatgtc aaaattagtg tgaatgtgat tgtcccatta tggactcaga gtcacttggc 
9961 ttttcaaagc tgttagggta gattatgtga tctgttttgg aataaggata ttgtaaatac 
10 021 ttcattagca ggtctttgaa ggttggataa tgtgtttttc tcattgagca cctactctat 
10081 gcagagtatt gctggggtag agtataccaa agatgaaata gacaaacatt cttaaataca 
30 10141 gacaattaag aggaagaaat ctagattaga aggagacttt tgttgaaaat aggaaaggaa 
10201 ttaagaatag ggtgtagtgc cttatcagtt gaaatgcatg tgtaagtgca aatttaatga 
10261 gactaatcat tatagactca ttagtgaggc tggacgcgat ggctcatgct gtagtaatcc 
10321 cagcacttgg gaggccgaag tgggtggatt actggagacc aggagatgaa gaccatcctg 
10381 gacaacatag tggacccgtc tcaattaaaa gtaaaacaaa cttagactca ttagtgaaat 
35 10441 aggtagataa tagaggtttc ttttatgaat taatgaatta atgaaagttg agaaatttgt 
10501 ggccttgggc tccagatacg tcccacagac attfcattgct tgattgaaac aataagtgct 
10561 tttfctagtgt tgaggatttg agatattgcc cacaaaactc agtatctagc tttttaaaaa 
10621 atatttgcaa gagcacgcaa catgggaact gacgctgccc tcctgtacgg cagcatctct 
10681 caaactgagt agtagcttcc atcttggttt gggcatatgc tctccagttt ttagtagtcc 
10741 tcaccaccct acttcctgtt ttctctcaaa tacatttttt ttctgttttt cttagatttg 
10801 actgttttct tcttgttctt tgtgggcatt tgaatttgtg acccttgagt taggtagtaa 
10861 atgtcagtgc gtggtaaagc ttattfcttgt aaatagttgt gaagacctta gatggaatgg 
10921 gtgttctaat ttgaagaatt ccttaaaagg attagaataa atagggagaa acaggagact 
10981 agaacttcag tgccaaatac atgttttctt tgtgtgtttt cccccctcta aacttgtgtt 
45 11041 tcttttaagg tcaataaaat gcatgttagc atattaaaat ttgtttttta ataccaggtg 
11101 gaatccatag agctccccat ggacaacaag accaataaga ggcgtgggtt ctgctttatt 
11161 acctttaagg aagaagaacc agtgaagaag ataatggaaa agaaatacca caatgttggt 
11221 cttagtaaag taagttaagc atccatttac ttgtagagaa aactagctgh tgtaaagagc 
11281 ttaaccattt atctttctct gtaaaggctt aagttctttg catgctttaa aaacttctca 
50 11341 ttggttactt accattgacc aactttttgt ggggagcagg atggacacat ttgttagtgt 
11401 ttttgtctag gctttcacaa aaatagtttt tagaacttga caagtaaaat gaagtaacat 
11461 cactagcaag tactcataag tgattactct taagtactaa atattggttt aattaataaa 
11521 Gtgactgagg aaagtttcaa attagcctac tctatttaaa catgttggct attctggtta 
11581 ttagaaactt atttagcaac ttttattttc ttgagtcagt fctaataatgt aatttttctc 
55 11641 ttttagtgtg aaataaaagt agccatgtcg aaggaacaat atcagcaaca gcaacagtgg 
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11701 ggatctagag gaggatttgc aggaagagct cgtggaagag gtggtggtaa gctagagcct 
11761 aagtfctactc tafcctfcaagc ttttctgctt tttaattatc ctgaagtaaa gatctttgct 
11821 gatcttctga ctttagtgaa cctattaatg tgctgcaggc cccagtcaaa actggaacca 
11881 gggatatagt aactattgga atcaaggcta tggcaactat ggatataaca gccaaggtta 
5 11941 cggtggttat ggaggatatg actacactgg ttacaacaac tactatggat atggtgatta 
12001 tagcagtaag tactatactt tbtatattaa ctgctatttg acatttattt tgtacaaatt 
12061 tggataggca gaaaggttag tgtagccttg ccaagtgcaa atgtcttcag gtttcaaatt 
12121 cctggaaact tgaaactgca gccattttat tgcfctggfctc ctcccagcct atatcacaca 
12181 cacattataa gggtagggtg tatgtgtggt tctatatatg tttgctgggc atttgtttta 
10 12241 cttggattta aaaattttaa gctcagttca gatcttttaa gctcaaggta ttctaaatgt 
123 01 cacttcttgg accaaccaat aatcttggca tgatgtgtct taatcctata aaactgaata 
12361 ataccacatg fctgccagtta aaactaafcag tatcctcgct ttaggattat taatgtagaa 
12421 actcttaaaa cagatgttga gcttgataga accaaaaaac ttgactttta gacatggaaa 
12481 gccctgactt cattgfcgcaa ctaaggtagt tgctctccac ctgatttgta gcaactgttg 
12541 agtcgtcagg taaagggttc tactagaagc aatcttacat ttttttggag gagagtggtt 
12601 gcafctggttg cattgtttta agtggttttt cttttccttc cttggttaga ccagttcttg 
12661 gagttatatc ctttcttagg tgactaggcc tgctgcacaa taataggtta attaaagtca 
12721 gaagaaggtc agcaaagatg gattgggtga gattggggcc cttttcttag aagggcagag 
12781 atactaagca ctgattgtgg ttgacaattt gttctaaatt ttaagatatt ttttgctggt 
20 12841 ggttgfcgaaa gggtcagctg tccatccttt gaaacttaaa acttttaaac tgtaagggtg 
12901 aggggattgt ctcccatttt atacaataag tcaagtaatc agctcattct gaatgcctgc 
12961 cattgtatgc attcactaca tatttggtaa attatttgat aaafcgattgc tcagggtgaa 
13 021 tttttcacac ttgggaatta agctaccctt aattttttga gattgtttaa aattaggtac 
13081 tgttctgatt attagtafcgt aaccactacc gttctggttc taacacttgt tttattttag 
25 13141 accagcagag tggttatggg aaggtatcca ggcgaggtgg tcatcaaaat agctacaaac 
13201 catactaaat tattccattt gcaacttatc cccaacaggt atgttctaaa aatagttttt 
13261 ttttgtcatt tacaatagta gtttttataa tctatattgt tcataaaaca atgcttaatfc 
13321 taagagtttc acagcaccca gaagtgctta ccatattata acatagtgac tttcaaaaga 
13381 tatgta.acac aggtgctctt aagcttttgc ctttttgtcc tattattaac aagtcagtaa 
13441 agttaacagg taaagtactg ctaatgggta caaatfiaagg aattgcagca aaaaagtatt 
13501 gcctactaac tctgacatta taccttgttt gtaccgccag cgggaacttc attgcaggcc 
13561 ctgtgtcgcg ctgacttcag attctcacag gcccgctcaa tgcggacagg gtaacgagat 
13621 gctccacgct ctcgaatgct gccgtttggt atggtctctt ccaacatcct gtatcagcat 
13681 tataaaataa aatggatact tcaagctttg ccttcactta tttctttgct ttttaaaaac 
35 13741 tatttgtaat gtaattttaa tgcatttttt acaggcccag taatggttaa atacgtcagc 
13801 ttactgaata attttaacta tttattcttc taaggataca gcttgtctct ggattttcca 
13861 gtcttaattt tatattttat taatctattt taatgcttgc ttttcccatt tatagacgtt 
13921 gtagcagtaa ttgcaagaag ttcttgagct gaattcctgt tgtgacaact tcctataatt 
13981 acagtagata actttttctt ttagtcgtat ataacttttc tataacttgt gatggacaag 
40 14041 agatatgctt atccaataaa ataagcttaa atattagatg ctcttgggtc aaaatgtcct 
14101 tttaccaaat tgaccttttt atgagttctt tgggtaaata ctttaaagct ttttatattt 
14161 taaagaatac ttgtaaaagc atatcacatc ttaaaccagt ggtgcacatg tggatttaca 
14221 gctcatggac tctactgttc agctttaatt tataaaacat atcacacatt ' taatgttata 
14281 cagtatttac atatagtgga acafcagggat aactcagttt tatgtaaatt tttgttaagt 
45 14341 gttgtagcct gcccagagtg acttctattt tttcttcttt gtctccaggt ggtgaagcag 
14401 tattttccaa tttgaagatt catttgaagg tggctcctgc cacctgctaa tagcagttca 
14461 aactaaattt tttgtatcaa gtccctgaat ggaagtatga cgttgggtcc ctctgaagtt 
14521 taattctgag ttctcattaa aagaaatttg ctttcattgt tttatttctt aattgctatg 
14581 cttcagaatc aatttgtgfct ttatgccctt tcccccagta ttgtagagca agtcttgtgt 
14641 taaaagccca gtgtgacagt gtcatgatgt agtagtgtct tactggttfct ttaataaatc 
14701 cttttgtata aaaatgtatt ggctctttta tcatcagaat aggaaaaaat tgtcatggat 
14 761 tcaagttatt aaaagcataa gtttggaaga caggcttgcc gaaattgagg acatgattaa 
14821 aattgcagtg aagtfctgaaa tgtttttagc aaaatctaat ttttgccata atgtgtcctc 
14881 cctgtccaaa ttgggaatga cttaatgtca atttgtttgt tggttgtttt aataatactt 
55 14941 ccttatgtag ccattaagat ttatatgaat attttcccca atg 
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SEQ ID NO. 42 

5 NP_852000 

GSK-3 Binding Protein - FRATl 

mpcrreeeee ageeaegeee eedsflllqg 
aspcgppgap Irapgplaaa vpadkarspa 
0 vrgraapycv aelatgpsal splppqadld 
rlqqrrgsqp etrtgdddph rllqqlvlsg 
sapvheppsp rspraacsdp gasgraqlrt 



svalgssgev drlvaqiget Iqldaaqhsp 
vplllppala etvgpappgv Ircalgdrgr 
gppgagkqgi pqplsgpcrr gwlrgaaasr 
nlikeavrrl hsrrlqlrak Ipqrpllgpl 
gdgvlvpgs 
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15 SEQ ID NO. 43 

NM_005479 

Homo sapiens frequently rearranged in advanced T-cell lymphomas 
FRATl) , transcript variant 1, rtiRNA 

20 

1 ggatt ccggc tcccgcggct gcaggcgcgc ggctagagtg cctggcgggc tccggcttcc 
61 gcgtccgccc cggccccggt ccagacttag tcttoagctc cgcgcccgct ccgccgcggc 
121 ccaccgcgcc cgccggcagc cgagccccca gcgacgcccg cacagctccg ggtgcccaga 
181 cagggggcca tgccgtgccg gagggaggag gaagaggaag ccggcgagga ggcggagggg 
25 241 gaggaagagg aggaggacag cttcctccta ctgcagcagt cagtggcgct gggcagctcg 
3 01 ggcgaggtgg accggctggt ggcccagatc ggcgagacgc tgcagctgga cgcggcgcag 
3 61 cacagcccgg cctcgccgtg cgggcccccg ggggcgccgc tgcgggcccc ggggcccctg 
421 gctgcggcgg tgccggcgga caaggccagg tccccggcgg tgccgctgct gctgccgccc 
481 gcgttggcgg agactgtggg cccggcgccc cctggggtcc tgcgctgcgc cctgggggac 
541 cgcggccgcg tgcggggccg cgctgcgccc tactgcgtgg ccgagctcgc cacaggcccc 
6 01 agcgcgctgt ccccactgcc ccctcaggcc gaccttgatg ggcctccggg agctggcaag 
661 cagggcatcc cgcagccgct gtcgggtccg tgccggcgag gatggctccg gggcgccgcc 
721 gcctcccgcc gcctgcagca gcgacgcggg tcccaaccag aaacccgcac aggcgacgac 
781 gacccgcacc ggcttctgca gcagctagtg ctctctggaa acctcatcaa ggaggccgtg 
35 841 cgaaggcttc attcgcgacg gctgcagtta cgtgcaaagc ttccccaacg cccgctcctg 
901 ggacctctgt cggccccggt gcatgaaccc ccttcgcctc gcagccctcg cgcggcctgc 
961 agtgaccctg gcgcctccgg gagggcgcag ctcagaactg gcgacggcgt tcttgtgcct 
1021 ggcagctaac acgcccgggg tggccacagc gccagcctca gactggaggg caaggggttc 
1081 ccttgagggc tgcagttcta ctcaggctgg tggagaactc tggcttttgg aagcgagagt 
1141 aaaaagctaa tgacgaggaa ccgaaaaatc gcgagtgttt cgcgggtaac tggggttgag 
1201 ggccaaaata tttggaatga aggacttggc cctatttaag gcagatttta cagagcgcac 
1261 ctcaaacgta caagtcagta ggactcctta tttggcgtga cccgacctgg ccgcggagcc 
1321 tgcatttcct cgcagcctct cagtgccctc cagccccgcg accatgtggc cacaatccac 
1381 gcttctccgg atcgcggtgc gccggaacca cggaggatga tgccagttac ttgctttacc 
1441 ttttcagggc tggctcctga tccactttgg gggaggagaa catgagtaga taatttcagg 
15 01 gtgcagccca atctgccaga cttaaaaaaa ccatcttgtg tctttggagg tgctgcttaa 
1561 taccaaacat gcggtgccat gaagggaccc tttgggggtt gaataggagt taacccctgc 
1621 gctctctttg caactgtctc tcttctcaga gtggtggggg aaggctgtac gacacgggtg 
1681 gggaaaggag gtgggggcgg ggagtattga atggtggtgg aagggtagag aggcgcggag 
1741 tgaaccGcac gccctgtcta aagtgtattt tcagagccgg cccgcctctc ctcggttcaa 
1801 ggtcactgtt tcctgggcac gcactgggtt gcgggacaga gtagccaggt tctgccggtg 
1861 ctcggagaag agcgcagtgt tttgcaagtg ctggagtctc ctgaggacac gcgcgtcgcc 
1921 gccaccgcgg gtgtgggaaa gcgcggacgt gctgggcggc tgtgcttcgg taggcgacca 
1981 ccgcccctgg ccgcgctccg ggctttcacg gaaactcccg agaccgggcc ctgggttcct 
55 2041 cctctcctac tcggctctgc agtcctactc aagcgggtgg ctctgggatc ctgggggcct 
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2101 gggttggggg ctagggagac gccatgtgat ggacactcca gggacacaca gcctagcaca 
2161 gcagcttata atgggctctc cggggccatt tgcaataaca gctgcaattc cctggataga 
2221 cgagttgatt tcctccctct gcccctcccc cagccatgcc agctggcctt tgtaagtgca 
2281 ggaaaccgag tagaaaatgt gaccctccaa atggagaagc tgcaggcttt gccattgtga 
5 2341 accatggtga agtgcttgga acatactgtt cactcactct aaaggcgctg agactgtgct 
2401 gttgttctcg tttttatagt caatggcttg ttcatcatcc agatgtggct actgacatat 
2461 ctacacttcg caccggagtg tctggaattg tggctatcct gattatagga tttfcaactta 
2521 actgaaatgc ctgctttgaa taaatgtgtt gggttttttg tttggtttta ttttatactt 
2581 gccatcagtg aaaaagatgt acagaacaca tttctctgat ctccataaac atgaaaacac 
10 2641 ttgaaatcfcc aaa 



SEQ XD HO. 44 

15 NM_181355 

Homo sapiens frequently rearranged in advanced T-cell lymphomas 
FRATl) , transcript variant 2, mRNA 

1 ggattccggc tcccgcggct gcaggcgcgc ggctagagtg cctggcgggc tccggcttcc 
20 61 gcgtccgccc cggccccggt ccagacttag tcttcagctc cgcgcccgct ccgccgcggc 

121 ccaccgcgcc cgccggcagc agagccccca gcgacgcccg cacagctccg ggtgcccaga 
181 cagggggcca tgccgtgccg gagggaggag gaagaggaag ccggcgagga ggcggagggg 
241 gaggaagagg aggaggacag cttcctccta ctgcagcagt cagtggcgct gggcagctcg 
3 01 ggcgaggtgg accggctggt ggcccagatc ggcgagacgc tgcagctgga cgcggcgcag 
25 361 cacagcccgg cctcgccgtg cgggcccccg ggggcgccgc tgcgggcccc ggggcccctg 
421 gctgcggcgg tgccggcgga caaggccagg tccccggcgg tgccgctgct gctgccgccc 
481 gcgttggcgg agactgtggg cccggcgccc cctggggt cc tgcgctgcgc cctgggggac 
541 cgcggccgcg tgcggggccg cgctgcgccc tactgcgtgg ccgagctcgc cacaggcccc 
601 agcgcgctgt ccccactgcc ccctcaggcc gaccttgatg ggcctccggg agctggcaag 
30 661 cagggcatcc cgcagccgct gtcgggtccg tgccggcgag gatggctccg gggcgccgcc 
721 gcctcccgcc gcctgcagca gcgacgcggg tcccaaccag aaacccgcac aggcgacgac 
781 gacccgcacc ggcttctgca gcagctagtg ctctctggaa acctcatcaa ggaggccgtg 
841 cgaaggcttc attcgcgacg gctgcagtta cgtgcaaagc ttccccaacg cccgctcctg 
901 ggacctctgt cggccccggt gcatgaaccc ccttcgcctc gcagccctcg cgcggcctgc 
35 961 agtgaccctg gcgcctccgg gagggcgcag ctcagaactg gcgacggcgt tcttgtgcct 
1021 ggcagctaac acgcccgggg tggccacagc gccagcctca gactggaggg caaggggttc 
1081 ccttgagggc tgcagttcta ctcaggctgg tggagaactc tggcttttgg aagcgagagt 
1141 aaaaagctaa tgacgaggaa ccgaaaaatc gcgagtgttt cgcgggtaac tggggttgag 
12 01 ggccaaaata tttggaatga aggacttggc cctatttaag gcagatttta cagagcgcac 
1261 ctcaaacgta caagtcagta ggactcctta tttggcgtga cccgacctgg ccgcggagcc 
1321 tgcatttcct cgcagcctct cagtgccctc cagccccgcg accatgtggc cacaatccac 
1381 gcttctccgg atcgcggtgc gccggaacca cggaggatga tgccagttac ttgctttacc 
1441 ttttcagggc tggctcctga tccactttgg gggaggagaa catgagtaga taatttcagg 
15 01 gtgcagccca atctgccaga cttaaaaaaa ccatcttgtg tctttggagg tgctgcttaa 
45 1561 taccaaacat gcggtgccat gaagggaccc tttgggggtt gaataggagt taacccctgc 
1621 gctctctttg caactgtctc tcttctcaga gtggtggggg aaggctgtac gacacgggtg 
1681 gggaaaggag gtgggggcgg ggagtattga atggtggtgg aagggtagag aggcgcggag 
1741 tgaacGccac gccctgtcta aagtgtattt tcagagccgg cccgcctctc ctcggttcaa 
18 01 ggtcactgtt tcctgggcac gcactgggtt gcgggacaga gtagccaggt tctgccggtg 
50 1861 ctcggagaag agcgcagtgt tfctgcaagtg ctggagtctc ctgaggacac gcgcgtcgcc 
1921 gccaccgcgg gtgtgggaaa gcgcggacgt gctgggcggc tgtgcttcgt caatggcttg 
1981 ttcatcatcc agatgtggct actgacatat ctacacttcg caccggagtg tctggaattg 
2041 tggctatcct gattatagga ttttaactta actgaaatgc ctgctttgaa taaatgtgtt 
2101 gggttttttg tttggtttta ttttatactt gccatcagtg aaaaagatgt acagaacaca 
55 2161 tttctctgat ctccataaac atgaaaacac ttgaaatctc aaa 
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SEQ ID NO. 45 

NP_444254 

myosin light chain isform kinase 

mgdvklvass hisktslsvd psrvdstnplt 
epqvtwhrng qpitsggrfl Idcgirgtfs 
Itvegsfakq Igqpwsktl gdrfsasave 
sckitgrpqp qvtwlkgnvp Iqpsarvsvs 
kasmsaelsi qgldsaxirsf vretkatnsd 
qrggsppwaa nsqpqppres klesckdspr 
Igvlspsgee rkrpapprpa tfptrqpglg 
qsqevkenqt vkfrceglav mevapsfssv 
ngqpiqyars tceagvaelh iqdalpedhg 
eyllpvapsk ptapiflqgl sdlkvmdgsq 
hfeqrgtqhs Iciqevfped tgtytceawn 
vtaslgqsvl iscaiagdpf ptvhwlrdgk 
gqyeillknr vgecscqvsl mlqnssaral 
rpgwpargqg wleeedgedv rgvlkrrvet 
Iseddlkeip aeqmdfranl qrqvkpktvs 
kvpppkpatp dfrsvlggkk klpaengsss 
kpaetlkpmg nakpaetlkp mgnakpdenl 
rsesqgtapa fkqklqdvhv aegkklllqc 
gslcsvsiek alpedrglyk cvakndagqa 
Ippvlgtesd atvkkkpapk tppkaaiuppq 
twmJcfrkqiq esehmkvens engskltila 
kpdppagtpc asdirssslt Iswygssydg 
nvqdllpdhe ykfrvrainv ygtsepsqes 
rtvtinteqk vsdfydieer Igsgkfgqvf 
isimnclhhp klvqcvdafe ekanivtnvle 
segveyihkq givhldlkpe nimcvnktgt 
pevinyepig yatdmwsigv icyilvsgls 
ddakd£isnl Ikkdmknrld ctqclqhpwl 
navraigrls sraamisglsg rksstgspts 
yfsktirdle vvegsaarfd ckiegypdpe 
sdvcgdddak ytckavnslg eatctaeliv 



2 

eapafilppr nlcikegata kfegrvrgyp 
Ivihavheed rgkytceatn gsgarqvtve 
trpsiwgecp pkfatklgrv wkegqmgrf 
ekngmqvlei hgvnqddvgv ytclwngsg 
vrkevtnvis keskldslea aakskncssp 
tapqtpvlqk tsssitlqaa rvqpeprapg 
sqdwskaan rripmegqrd safpkfeskp 
Ikdcaviegq dfvlqcsvrg tpvpritwll 
tytclaenal gqvscsawvt vhekkssrks 
vtmtvqvsgn pppeviwlhn gneigesedf 
sagevrtqav Itvqephdgt qpwfiskprs 
alckdtghfe vlqnedvftl vlkkvqpwha 
prgrepasce dlcgggvgad gggsdrygsl 
rqhteeairq qeveqldfrd llgkkvstkt 
eeerkvhspq qvdfrsvlak kgtsktpvpe 
aetlnakave sskplsnaqp sgplkpvgna 
ksaskeelkk dvkndvnckr ghagttdnek 
qvssdppati iwtlngktlk ttkfiiieqe 
ecscqvtvdd apasentkap emksrrpkss 
iiqfpedqkv ragesvelfg kvtgtqpitc 
arqehcgcyt llvenklgsr qaqvnltwd 
gsavqsysie iwdsanktwk elatcrstsf 
elttvgekpe epkdevevsd ddekepevdy 
rlvekktrkv wagkffkays akekenirqe 
ivsggelfer iidedfelte recikymrqi 
riklidfgla rrlenagslk vlfgtpefva 
pfmgdndnet lanvtsatwd fddeafdeis 
mkdtknmeak klskdrrakky marrkwqktg 
plnaeklese edvsqaflea vaeekphvkp 
wwfkddqsi resrhfqidy dedgncslii 
etmeegegeg eeeee 



SEQ ID NO. 4 6 
AF069601 

Homo sapiens myosin light chain kinase isoform 2 (MLCK) mRNA, 
complete cds 



ccggctgcct 
acgcctttcc 

tgggggatgt 

cctcaagagt 
acctctgcat 
agccccaggt 
tggattgcgg 
Sgggsiaagfca 
tgacagtaga 
gggatagatt 



ctgctgcagt 
tgtcccactg 
gaagctggtt 
tgactccatg 
caaagaagga 
gacatggcac 
catccggggg 
tacctgfcgaa 
aggaagtttt 
ttcagcttca 



tcagagcaac 
gcccagttgc 
gcctcgtcac 
cccctgacag 
gccaccgcca 
agaaacgggc 
actttcagcc 
gccaccaatg 
gcgaagcagc 
gcagtggaga 



ttcaggagct 
cacaacaaac 
acatttccaa 

aggcccctgc 
agttcgaagg 
aacccatcac 
ttgtgattca 
gcagtggtgc 
ttggtcagcG 
Gccgtcctag 



tcccagccga 
aacagagaag 
aacctccctc 
tttcattttg 
gcgggtccgg 
cagcgggggc 
tgctgtccat 
tcgccaggtg 
tgttgtttcc 
catctggggg 



gagcttcagg 
acggtgacca 
agtgtggatc 
ccccctcgga 
ggfctacccag 
cgcttcctgc 
gaggaggaca 
acagtggagt 
aaaaccttag 
gagtgcccac 
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caaagtttgc taccaagctg ggccgagttg 
cctgcaagat cactggccgg ccccaaccgc 
tgcagccgag tgcccgtgtg tctgtgtctg 
atggagtcaa ccaagatgac gtgggagtgt 
5 aggcctcgat gtcagctgaa ctttccatcc 
tgagagaaac aaaagccacc aattcagatg 
^ggagtcgaa gctggacagt ctggaggctg 
agagaggtgg ctccccaccc tgggctgcaa 
agctggagtc atgcaaggac tcgcccagaa 
10 cttccagctc catcaccctg caggccgcaa 
tgggggtcct atcaccttct ggagaagaga 
ccttccccac caggcagcct ggcctgggga 
ggagaatccc catggagggc cagagggatt 
aaagccagga ggtcaaggaa aatcaaactg 
15 tggaggtggc cccctccttc tccagtgtcc 
attttgtgct gcagtgctcc gtacggggga 
atgggcagcc catccagtac gctcgctcca 
tccaggatgc cctgccggag gaccatggca 
ggcaggtgtc ctgcagcgcc tgggtcaccg 

20 agtaccttct gcctgtggct cccagcaagc 
ctgatctcaa agtcatggat ggaagccagg 
caccccctga agtcatctgg ctgcacaatg 
actttgaaca gagaggaact cagcacagcc 
cgggcacgta cacctgcgag gcctggaaca 

25 tcacggtaca agagcctcac gatggcaccc 
tgacagcctc cctgggccag agtgtcctca 
ctaccgtgca ctggctcaga gatggcaaag 
tgcttcagaa tgaggacgtg ttcaccctgg 
gccagtatga gatcctgctc aagaaccggg 

30 tgctacagaa cagcfcctgcc agagcccttc 
acctctgtgg tggaggagtt ggtgctgatg 
ggcctggctg gccagcaaga gggcagggtt 
gaggggtgct gaagaggcgc gtggagacga 
aggaggtgga gcagctggac ttccgagacc 

35 tatcggaaga cgacctgaag gagatcccgg 
agcggcaagt gaagccaaag actgtgtctg 
aggtcgattt tcgctctgtc ctggccaaga 
^ggtgccacc gccaaaacct gccaccccgg 
aattaccagc agagaatggc agcagcagtg 

40 gttccaagcc cctgagcaat gcacagcctt 
agcctgctga gaccctgaag ccaatgggca 
tgggcaatgc caagcctgat gagaacctga 
acgttaagaa tgatgtgaac tgcaagagag 
gatcagagag ccaggggaca gccccagcct 

45 cagagggcaa gaagctgctg ctccagtgcc 
tctggacgct gaatggaaag accctcaaga 
gctcactctg ctccgtctcc atcgagaagg 
gtgtagccaa gaatgacgct ggccaggcgg 
ctccagccag tgagaacacc aaggccccag 

50 ttcctcccgt gctaggaact gagagtgatg 
cacctccgaa ggcagcaatg ccccctcaga 
gcgcaggaga gtcagtggag ctgtttggca 
cctggatgaa gttccgaaag cagatccagg 
agaatggcag caagctcacc atcctggccg 

55 fcgctggtgga gaacaagctg ggcagcaggc 



-36- 

tggtcaaaga aggacagatg ggacgattct 
aggtcacctg gctcaaggga aatgttccac 
agaagaacgg catgcaggtt ctggaaatcc 
acacgtgcct ggtggtgaac gggtcgggga 
aaggtttgga cagtgccaat aggtcatttg 
tcaggaaaga ggtgaccaat gtaatctcaa 
cagccaaaag caagaactgc tccagccccc 
acagccagcG tcagccccca agggagtcca 
cggccccgca gaccccggtc cttcagaaga 
gagttcagcc ggaaccaaga gcaccaggcc 
ggaagaggcc agctcctccc cgtccagcca 
gccaagatgt tgtgagcaag gctgctaaca 
cagcattccc caaatttgag agcaagcccc 
tcaagttcag atgtgaaggg cttgccgtga 
tgaaggactg cgctgttatt gagggccagg 
ccccagtgcc ccggatcact tggctgctga 
cctgcgaggc cggcgtggct gagctccaca 
cctacacctg cctagctgag aatgccttgg 
tccatgaaaa gaagagtagc aggaagagtg 
ccactgcacc catcttcctg cagggcctct 
tcactatgac tgtccaagtg tcagggaatc 
ggaatgagat ccaagagtca gaggacttcc 
tttggatcca ggaagtgttc ccggaggaca 
gcgctggaga ggtccgcacc caggccgtgc 
agccctggtt catcagtaag cctcgctcag 
tctcctgcgc catagctggt gacccctttc 
ccctctgcaa agacactggc cacttcgagg 
ttctaaagaa ggtgcagccc tggcatgccg 
ttggcgaatg cagttgccag gtgtcactga 
cacgggggag ggagcctgcc agctgcgagg 
gtggtggtag tgaccgctat gggtccctga 
ggctagagga ggaagacggc gaggacgtgc 
ggcagcacac tgaggaggcg atccgccagc 
tcctggggaa gaaggtgagt acaaagaccc 
ccgagcagat ggatttccgt gccaacctgc 
aggaagagag gaaggtgcac agcccccagc 
aggggacttc caagaccccc gtgcctgaga 
attttcgctc agtgctgggt ggcaagaaga 
ccgagaccct gaatgccaag gcagtggaga 
cagggccctt gaaacccgtg ggcaacgcca 
acgccaagcc tgccgagacc ctgaagccca 
aatccgctag caaagaagaa ctcaagaaag 
gccatgcagg gaccacagat aatgaaaaga 
tcaagcagaa gctgcaagat gttcatgtgg 
aggtgtcttc tgacccccca gccaccatca 
Gcaccaagtt catcatcctc tcccaggaag 
cactgcctga ggacagaggc ttatacaagt 
agtgctcctg ccaagtcacc gtggatgatg 
agatgaaatc ccggaggccc aagagctctc 
cgactgtgaa aaagaaacct gcccccaaga 
tcatccagtt ccctgaggac cagaaggtac 
aagtgacagg cactcagccc atcacctgta 
aaagcgagca catgaaggtg gagaacagcg 
cgcgccagga gcactgcggc tgctacacac 
aggcccaggt caacctcact gtcgtggata 
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agccagaccc cccagctggc acaccttgtg cctctgacat tcggagctcc tcactgaccc 
tgtcctggta tggctcctca fcafcgatgggg gcagtgctgt acagtcctac agcatcgaga 
tctgggactc agccaacaag acgtggaagg aactagccac atgccgcagc acctctttca 
acgtccagga cctgctgcct gaccacgaat ataagttccg tgtacgtgca atcaacgtgt 
atggaaccag tgagccaagc caggagtctg aactcacaac ggtaggagag aaacctgaag 
agccgaagga tgaagtggag gtgtcagatg atgatgagaa ggagcccgag gttgattacc 
ggacagtgac aatcaatact gaacaaaaag tatctgactt ctacgacatt gaggagagat 
taggatctgg gaaatttgga caggtctttc gacttgtaga aaagaaaact cgaaaagtct 
gggcagggaa gttcttcaag gcatattcag caaaagagaa agagaatatc cggcaggaga 
ttagcatcat gaactgcctc caccacccta agctggtcca gtgtgtggat gcctttgaag 
aaaaggccaa catcgtcatg gtcctggaga tcgtgtcagg aggggagctg tttgagcgca 
tcattgacga ggactttgag ctgacggagc gtgagtgcat caagtacatg cggcagatct 
cggagggagt ggagtacatc cacaagcagg gcatcgtgca cctggacctc aagccggaga 
acatcatgtg tgtcaacaag acgggcacca ggatcaagct catcgacttt ggtctggcca 
15 ggaggctgga gaatgcgggg tctctgaagg tcctctttgg caccccagaa tttgtggcfcc 
ctgaagtgat caactatgag Gccatcggct acgccacaga catgtggagc atcggggtca 
tctgctacat cctagtcagt ggcctttccc ccttcatggg agacaacgat aacgaaacct 
tggccaacgt tacctcagcc acctgggact tcgacgacga ggcattcgat gagatctccg 
acgatgccaa ggatttcatc agcaatctgc tgaagaaaga tatgaaaaac cgcctggact 
gcacgcagtg ccttcagcat ccatggctaa tgaaagatac caagaacatg gaggccaaga 
aactctccaa ggaccggatg aagaagtaca tggcaagaag gaaatggcag aaaacgggca 
atgctgtgag agccattgga agactgtcct ctatggcaat gatctcaggg cfccagtggca 
ggaaatcctc aacagggtca ccaaccagcc cgctcaatgc agaaaaacta gaatctgaag 
aagatgtgtc ccaagctttc cttgaggctg ttgctgagga aaagcctcat gtaaaaccct 
25 atttctctaa gaccattcgc gatttagaag ttgtggaggg aagtgctgct agatttgact 
gcaagattga aggataccca gaccccgagg ttgtctggtt caaagatgac cagtcaafcca 
gggagtcccg ccacttccag atagactacg atgaggacgg gaactgctct ttaattatta 
gtgatgtttg cggggatgac gatgccaagt acacctgcaa ggctgtcaac agtcttggag 
aagccacctg cacagcagag ctcattgtgg aaacgatgga ggaaggtgaa ggggaagggg 
aagaggaaga agagtgaaac aaagccagag aaaagcagtt tctaagtcat attaaaagga 
Gtatttctct caaaatcca 



20 



30 



SEQ ID NO. 47 

35 A?IH07433 and P09493 

tropomyosin 1 alpha chain 



40 



1 mdaikkkmqm Ikldkenald raeqaeadkk aaedrskqle delvslqkka kgtedeldky 
61 sealkdaqek lelaekkatd aeadvaslnr riqlveeeld raqerlatal qkleeaekaa 
121 desergmkvi esraqkdeek meiqeiqlke akhiaedadr kyeevarklv iiesdlerae 
181 eraelseggv rqleeqlrim dqtlkalmaa edkysqkedr yeeeikvlsd klkeaetrae 
241 faersvtkle ksiddledel yaqklkykai sesiakialnd mtsm 



45 SEQ ID NO. 48 

NM_000366 and BC007433 

Homo sapiens tropomyosin 1 (alpha) , mRNA (cDNA clone) , complete cds 

50 1 gaggaatgcg gtcgccccct tgggaaagta catatctggg agaagcaggc ggctccgcgc 

61 tcgcactccc gctcctccgc ccgaccgcgc get cgccccg ccgctcctgc tgcagcccca 
121 gggcccctcg ccgccgccac catggacgcc atcaagaaga agatgcagat gctgaagctc 
181 gacaaggaga acgccttgga tcgagctgag caggcggagg ccgacaagaa ggcggcggaa 
241 gacaggagca agcagctgga agatgagctg gtgtcactgc aaaagaaact caagggcacc 

55 301 gaagatgaac tggacaaata ttctgaggct ctcaaagatg cccaggagaa gctggagctg 
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361 gcagagaaaa aggccaccga tgctgaagcc gacgtagctt ctctgaacag acgcatccag 
421 ctggttgagg aagagttgga fccgtgcccag gagcgtctgg caacagcttt gcagaagctg 
481 gaggaagctg agaaggcagc agatgagagt gagagaggca tgaaagtcat tgagagtcga 
541 gcccaaaaag atgaagaaaa aatggaaatt caggagatcc aactgaaaga ggcaaagcac 

6 01 attgctgaag atgccgaccg caaatatgaa gaggtggccc gtaagctggt catcattgag 
661 agcgacctgg aacgtgcaga ggagcgggct gagctctcag aaggccaagt ccgacagctg 
721 gaagaacaat taagaataat ggatcagacc ttgaaagcat taatggctgc agaggataag 

7 81 tactcgcaga aggaagacag atatgaggaa gagatcaagg tcctttccga caagctgaag 
841 gaggctgaga ctcgggctga gtttgcggag aggtcagtaa ctaaattgga gaaaagcatt 
9 01 gatgacttag aagacgagct gtacgctcag aaactgaagt acaaagccat cagcgaggag 
961 ctggaccacg ctctcaacga tatgacttcc atgtaaacgt tcatccactc tgcctgctta 

1021 caccctgccc tcatgctaat ataagtttct ttgcttcact tcfccccaaga ctccctcgtc 
1081 gagctggatg tcccacctct ctgagctctg catttgtcta ttctccagct gaccctggtt 
1141 ctctctctta gcatcctgcc ttagagccag gcacacactg tgctttctat tgtacagaag 
12 01 ctcttcgttt cagtgtcaaa taaacactgt gtaagctaaa aaaaaaaaaa aaaaaa 



SEQ XD KO. 49 

20 EITAIiAPSTMK 

SEQ XD NO. 50 

25 MIiTBLEK 

SEQ ID NO. 51 

30 AusrsiiDvyHK 



SEQ ID NO- 52 

35 GADVWPK 
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